tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "William Palmer (JIRA)" <j...@apache.org>
Subject [jira] [Created] (TIKA-1232) Add PDF version to PDFParser output
Date Wed, 05 Feb 2014 14:06:10 GMT
William Palmer created TIKA-1232:

             Summary: Add PDF version to PDFParser output
                 Key: TIKA-1232
                 URL: https://issues.apache.org/jira/browse/TIKA-1232
             Project: Tika
          Issue Type: Improvement
          Components: parser
    Affects Versions: 1.5
         Environment: JDK6
            Reporter: William Palmer
            Priority: Minor
         Attachments: pdfversion.patch

I'd like to identify the PDF version of files, this is not currently reported by the PDFParser
although the information is available via PDFBox.  I have attached a patch that adds the format
version to the Metadata object.

However, I am not familiar enough with the Tika source to know if an alternative metadata
key should be used, or this new one added.

Comments welcome.

This message was sent by Atlassian JIRA

View raw message