tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Joachim Zittmayr (JIRA)" <j...@apache.org>
Subject [jira] Created: (TIKA-223) PDFParser causes Problems when using encrypted PDF documents
Date Wed, 06 May 2009 14:15:30 GMT
PDFParser causes Problems when using encrypted PDF documents
------------------------------------------------------------

                 Key: TIKA-223
                 URL: https://issues.apache.org/jira/browse/TIKA-223
             Project: Tika
          Issue Type: Bug
          Components: parser
    Affects Versions: 0.3
         Environment: Java 1.5.x on MAC, WIN, LIN
            Reporter: Joachim Zittmayr
             Fix For: 0.4


The PDFParser.parse() method decrypts the document for the metadata already and then passes
it over to PDF2XHTML.process(), which in turn calls the inherited getText(). This calls writeText(),
which tries to decrypt the PDDocument again, but this will fail as it is already decrypted.
The solution would be to override  writeText(), without the document.isEncrypted check.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message