nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sujen Shah (JIRA)" <>
Subject [jira] [Commented] (NUTCH-2206) Provide example scoring.similarity.stopword.file
Date Tue, 26 Jan 2016 19:33:40 GMT


Sujen Shah commented on NUTCH-2206:

Ohh yes, will do it now, missed it in the patch. 

> Provide example scoring.similarity.stopword.file
> ------------------------------------------------
>                 Key: NUTCH-2206
>                 URL:
>             Project: Nutch
>          Issue Type: Bug
>          Components: plugin, scoring
>    Affects Versions: 1.11
>            Reporter: Lewis John McGibbney
>            Assignee: Lewis John McGibbney
>             Fix For: 1.12
>         Attachments: NUTCH-2206.patch
> The scoring-similarity plugin does not provide an example file for the property scoring.similarity.stopword.file.
> This is an issue for a number of reasons, namely 
>  * A user does not know what it is meant to look like, and
>  * We always check of this file and will [throw an exception if it is not found|],
this may not be picked up by the user until much later.
> I suggest a simple fix here, simply include the [standard English stop words taken from
Lucene's StopAnalyzer|].
The comments will help people to easily customize the list to whatever they require. 

This message was sent by Atlassian JIRA

View raw message