tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jukka Zitting (JIRA)" <j...@apache.org>
Subject [jira] Updated: (TIKA-126) Add Parser.parse(InputStream, Metadata) for metadata extraction
Date Fri, 19 Sep 2008 22:16:47 GMT

     [ https://issues.apache.org/jira/browse/TIKA-126?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Jukka Zitting updated TIKA-126:

    Fix Version/s:     (was: 0.2-incubating)

> Add Parser.parse(InputStream, Metadata) for metadata extraction
> ---------------------------------------------------------------
>                 Key: TIKA-126
>                 URL: https://issues.apache.org/jira/browse/TIKA-126
>             Project: Tika
>          Issue Type: New Feature
>          Components: parser
>            Reporter: Jukka Zitting
>            Assignee: Jukka Zitting
> In some cases a client is just interested in the parsed metadata and not the extracted
text content. It is easy to ignore the text content by just passing a dummy DefaultHandler
to the existing parse() method, but many parsers could avoid a lot of work if they knew in
advance that the text content is not needed.
> Thus I want to add a parse(InputStream, Metadata) signature to the Parser interface.
I'll also add an AbstractParser base class with a trivial implementation of that method:
>     public abstract AbstractParser implements Parser {
>         public void parse(InputStream stream, Metadata metadata) {
>             parse(stream, new DefaultHandler(), metadata);
>         }
>     }

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message