tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Nick Burch (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (TIKA-213) JSON output from Tika CLI
Date Sun, 10 Apr 2011 18:32:06 GMT

    [ https://issues.apache.org/jira/browse/TIKA-213?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13018129#comment-13018129

Nick Burch commented on TIKA-213:

The patch looks a good start (thanks Selva!). A couple of things we might want to tweak are:
* I'm not sure the removal of quotes in the values is correct, shouldn't we escape it?
* Numbers could be output without quoting
* If we have several values for one metadata field, we should probably output it as key:array
rather than multiple key:value entries

Some of these changes might be easier with a json library, anyone know if jackson for example
would help with them?

> JSON output from Tika CLI
> -------------------------
>                 Key: TIKA-213
>                 URL: https://issues.apache.org/jira/browse/TIKA-213
>             Project: Tika
>          Issue Type: New Feature
>          Components: cli
>            Reporter: Jukka Zitting
>            Assignee: Chris A. Mattmann
>            Priority: Minor
>         Attachments: json_output_option.patch
> From the ApacheCon: CouchDB seems interested in Tika, and they'd like to see an option
for producing JSON output from the Tika CLI.

This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

View raw message