[ https://issues.apache.org/jira/browse/NUTCH-901?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Markus Jelsma updated NUTCH-901:
--------------------------------
Attachment: NUTCH-901-trunk.998961.patch
Here's also a patch for 2.0 trunk. I could not test the code because i haven't managed to
compile trunk as of yet.
> Make index-more plug-in configurable
> ------------------------------------
>
> Key: NUTCH-901
> URL: https://issues.apache.org/jira/browse/NUTCH-901
> Project: Nutch
> Issue Type: Improvement
> Components: indexer
> Affects Versions: 1.2, 2.0
> Reporter: Markus Jelsma
> Fix For: 2.0
>
> Attachments: NUTCH-901-MarkusJelsma.998958.patch, NUTCH-901-trunk.998961.patch
>
>
> In my case, i don't want the index-more plug-in to split content-types on slash. Tokenization
is something a Solr instance should take care of. Instead of removing the code (which would
break compatibility for users that rely on it), we need a way to configure the plug-in not
to split the content-type.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
|