nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Lewis John McGibbney (JIRA)" <>
Subject [jira] [Created] (NUTCH-2206) Provide example scoring.similarity.stopword.file
Date Tue, 26 Jan 2016 01:52:40 GMT
Lewis John McGibbney created NUTCH-2206:

             Summary: Provide example scoring.similarity.stopword.file
                 Key: NUTCH-2206
             Project: Nutch
          Issue Type: Bug
          Components: plugin, scoring
    Affects Versions: 1.11
            Reporter: Lewis John McGibbney
            Assignee: Lewis John McGibbney
             Fix For: 1.12

The scoring-similarity plugin does not provide an example file for the property scoring.similarity.stopword.file.
This is an issue for a number of reasons, namely 
 * A user does not know what it is meant to look like, and
 * We always check of this file and will [throw an exception if it is not found|],
this may not be picked up by the user until much later.

I suggest a simple fix here, simply include the [standard English stop words taken from Lucene's
The comments will help people to easily customize the list to whatever they require. 

This message was sent by Atlassian JIRA

View raw message