tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Tim Allison (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (TIKA-1451) Add Recursive Metadata Parser Wrapper output to tika-app and gui
Date Fri, 24 Oct 2014 11:15:34 GMT

    [ https://issues.apache.org/jira/browse/TIKA-1451?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14182674#comment-14182674

Tim Allison commented on TIKA-1451:

Thank you, Chris.  The credit goes to [~jukkaz] and [~gagravarr] for the recursive parser
example!  I'm grateful to now have an out-of-the-box format (w/ serializers and deserializers)
that captures embedded document metadata.

As I was working on this, I was starting to think that we might want to add some "tika:" prefixed
properties to TikaCoreProperties to capture metadata generated during processing, such as:
tika:content, tika:parse_time_millis, tika:exception, tika:parsed_by (instead of our current
X-Parsed-By).  In effect, move the RecursiveParserWrapper properties to TikaCoreProperties
and add some others as necessary.

> Add Recursive Metadata Parser Wrapper output to tika-app and gui
> ----------------------------------------------------------------
>                 Key: TIKA-1451
>                 URL: https://issues.apache.org/jira/browse/TIKA-1451
>             Project: Tika
>          Issue Type: Improvement
>            Reporter: Tim Allison
>            Priority: Minor
>             Fix For: 1.7
>         Attachments: integrate_recursive_metadata_wrapper.patch
> It would be helpful to expose the output of the recursive metadata parser wrapper in
the gui and in the command line for tika-app.

This message was sent by Atlassian JIRA

View raw message