tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Fausto Cruzeiro de Moraes (Commented) (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (TIKA-876) Signed pdf parsing
Date Fri, 16 Mar 2012 20:53:39 GMT

    [ https://issues.apache.org/jira/browse/TIKA-876?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13231616#comment-13231616

Fausto Cruzeiro de Moraes commented on TIKA-876:

Hi Nick

I am running Tika over two files: PDFnotsigned.pdf (original pdf document, application/pdf)
and PDFsigned.pdf.p7s (digitally signed document, application/pkcs7-signature).

1 - When running the statement: java -jar tika-app-1.0.jar -t PDFnotsigned.pdf > PDFnotsigned.pdf.txt,
i get an output file with the expected content

2 - When running the statement: When running the statement: java -jar tika-app-1.0.jar -t
PDFsigned.pdf > PDFsigned.pdf.txt, i get an output file with no content at all, just 0Kb.

As far as I am concerned, there is no default tika filter related to application/pkcs7-signature


> Signed pdf parsing
> ------------------
>                 Key: TIKA-876
>                 URL: https://issues.apache.org/jira/browse/TIKA-876
>             Project: Tika
>          Issue Type: New Feature
>          Components: parser
>    Affects Versions: 1.0
>         Environment: Java 6.0, Ubuntu
>            Reporter: Fausto Cruzeiro de Moraes
>              Labels: features
>             Fix For: 1.0
>   Original Estimate: 168h
>  Remaining Estimate: 168h
> Is there an estimated date for implementing default parsing for signed documents, like
signed pdf files (pk7s format), for example?

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira


View raw message