tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ken Krugler (JIRA)" <j...@apache.org>
Subject [jira] Commented: (TIKA-288) Support override parsers in AutoDetectParser
Date Sun, 11 Oct 2009 16:08:31 GMT

    [ https://issues.apache.org/jira/browse/TIKA-288?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12764477#action_12764477

Ken Krugler commented on TIKA-288:

Hi Jukka,

If overriding in TikaConfig, would you suggest a TikaConfig.override(class, Parser), or something


-- Ken

> Support override parsers in AutoDetectParser
> --------------------------------------------
>                 Key: TIKA-288
>                 URL: https://issues.apache.org/jira/browse/TIKA-288
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 0.4
>            Reporter: Ken Krugler
>            Priority: Minor
> In some situations, being able to specify an alternative parser is useful even when the
general parser framework/full set of parsers is desired.
> For example, when processing HTML documents the current HtmlParser doesn't pass through
all of the tags that a vertical crawler might want.
> I'm proposing an alternative constructor, something like:
> public AutoDetectParser(Map<class, Parser>)
> where class would be the class of the standard Tika parser, and Parser is the override.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message