[ https://issues.apache.org/jira/browse/TIKA-1451?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14182674#comment-14182674
]
Tim Allison commented on TIKA-1451:
-----------------------------------
Thank you, Chris. The credit goes to [~jukkaz] and [~gagravarr] for the recursive parser
example! I'm grateful to now have an out-of-the-box format (w/ serializers and deserializers)
that captures embedded document metadata.
As I was working on this, I was starting to think that we might want to add some "tika:" prefixed
properties to TikaCoreProperties to capture metadata generated during processing, such as:
tika:content, tika:parse_time_millis, tika:exception, tika:parsed_by (instead of our current
X-Parsed-By). In effect, move the RecursiveParserWrapper properties to TikaCoreProperties
and add some others as necessary.
> Add Recursive Metadata Parser Wrapper output to tika-app and gui
> ----------------------------------------------------------------
>
> Key: TIKA-1451
> URL: https://issues.apache.org/jira/browse/TIKA-1451
> Project: Tika
> Issue Type: Improvement
> Reporter: Tim Allison
> Priority: Minor
> Fix For: 1.7
>
> Attachments: integrate_recursive_metadata_wrapper.patch
>
>
> It would be helpful to expose the output of the recursive metadata parser wrapper in
the gui and in the command line for tika-app.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
|