tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Nick Burch (Commented) (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (TIKA-850) Consistent way to supply document passwords to parsers
Date Wed, 25 Jan 2012 17:10:41 GMT

    [ https://issues.apache.org/jira/browse/TIKA-850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13193124#comment-13193124
] 

Nick Burch commented on TIKA-850:
---------------------------------

Currently, the objects set onto the ParseContext are:
 * Detector.class
 * DocumentSelector.class
 * EmbeddedDocumentExtractor.class
 * Locale.class
 * MimeConfig.class
 * Parser.class

The ones set onto the Metadata for use by parsers are:
 * RESOURCE_NAME_KEY (resourceName)
 * CONTENT_TYPE (Content-Type)
 * PASSWORD (org.apache.pdfbox.tika.password) *PDF Only*
 * TIKA_MIME_FILE (tika.mime.file);
 * MIME_TYPE_MAGIC (mime.type.magic);

                
> Consistent way to supply document passwords to parsers
> ------------------------------------------------------
>
>                 Key: TIKA-850
>                 URL: https://issues.apache.org/jira/browse/TIKA-850
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 1.0
>            Reporter: Nick Burch
>
> Currently, PDF document passwords are supplied to the parser via a special key on the
Metadata object, while the Office Parser has a TODO and only supports the default password
> We should update all the parsers that support encrypted documents (currently PDF, Office
OLE2 and Office OOXML) to receive the password in a consistent way

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message