nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Chris A. Mattmann (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (NUTCH-2052) Enhance index-static to allow configurable delimiters
Date Thu, 02 Jul 2015 21:44:04 GMT

    [ https://issues.apache.org/jira/browse/NUTCH-2052?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14612579#comment-14612579
] 

Chris A. Mattmann commented on NUTCH-2052:
------------------------------------------

Thanks Peter! I think someone on the 2.x side can forward port this to 2.x so we should be
good. I'll try your PR again against the latest trunk and try to get it in there. Thanks much!

> Enhance index-static to allow configurable delimiters
> -----------------------------------------------------
>
>                 Key: NUTCH-2052
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2052
>             Project: Nutch
>          Issue Type: Improvement
>          Components: indexer
>    Affects Versions: 1.10
>            Reporter: Peter Ciuffetti
>            Assignee: Chris A. Mattmann
>             Fix For: 1.11
>
>
> The index-static plugin has a set of fixed-value delimiters that control the parsing
of the property index.static.
> comma is used to separate fields
> colon is used to separate field name from field value
> space is used to separate multiple values in the field
> This set of choices makes it impossible to have a fixed field value containing a space,
comma or colon.
> The proposed enhancement is to allow configuration properties to override any of these
defaults.
> index.static.fieldsep (default comma)
> index.static.keysep (default colon)
> index.static.valuesep (default space)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message