tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Fausto Cruzeiro de Moraes (Commented) (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (TIKA-876) Signed pdf parsing
Date Fri, 16 Mar 2012 20:53:39 GMT

    [ https://issues.apache.org/jira/browse/TIKA-876?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13231616#comment-13231616
] 

Fausto Cruzeiro de Moraes commented on TIKA-876:
------------------------------------------------

Hi Nick

I am running Tika over two files: PDFnotsigned.pdf (original pdf document, application/pdf)
and PDFsigned.pdf.p7s (digitally signed document, application/pkcs7-signature).

1 - When running the statement: java -jar tika-app-1.0.jar -t PDFnotsigned.pdf > PDFnotsigned.pdf.txt,
i get an output file with the expected content

2 - When running the statement: When running the statement: java -jar tika-app-1.0.jar -t
PDFsigned.pdf > PDFsigned.pdf.txt, i get an output file with no content at all, just 0Kb.

As far as I am concerned, there is no default tika filter related to application/pkcs7-signature
mimetype...

Thanks





                
> Signed pdf parsing
> ------------------
>
>                 Key: TIKA-876
>                 URL: https://issues.apache.org/jira/browse/TIKA-876
>             Project: Tika
>          Issue Type: New Feature
>          Components: parser
>    Affects Versions: 1.0
>         Environment: Java 6.0, Ubuntu
>            Reporter: Fausto Cruzeiro de Moraes
>              Labels: features
>             Fix For: 1.0
>
>   Original Estimate: 168h
>  Remaining Estimate: 168h
>
> Is there an estimated date for implementing default parsing for signed documents, like
signed pdf files (pk7s format), for example?

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message