nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Markus Jelsma (JIRA)" <j...@apache.org>
Subject [jira] [Closed] (NUTCH-877) Allow setting of slop values for non-quote phrase queries on query-basic plugin
Date Fri, 01 Apr 2011 14:33:05 GMT

     [ https://issues.apache.org/jira/browse/NUTCH-877?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Markus Jelsma closed NUTCH-877.
-------------------------------


> Allow setting of slop values for non-quote phrase queries on query-basic plugin
> -------------------------------------------------------------------------------
>
>                 Key: NUTCH-877
>                 URL: https://issues.apache.org/jira/browse/NUTCH-877
>             Project: Nutch
>          Issue Type: Improvement
>          Components: searcher
>    Affects Versions: 1.2
>         Environment: All
>            Reporter: Dennis Kubes
>            Assignee: Dennis Kubes
>             Fix For: 1.2
>
>         Attachments: NUTCH-877-1-20100809.patch
>
>
> Patch adds a configuration variable for setting slop values on phrase queries.  The default
slop value, which currently can't be changed through configuration, is Integer.MAX_VALUE.
 It produces something like this, which doesn't seem right to me.  If you are searching for
a phrase you usually want it within a certain distance:
> 2.9141337E-4 = weight(content:"my phrase"~2147483647 in 1029), product of:
>     * 0.07163286 = queryWeight(content:"my phrase"~2147483647), product of:
>           o 9.657982 = idf(content: my=13470 phrase=534)
>           o 0.0074169594 = queryNorm
> This patch adds the query.phrase.slop configuration value to the nutch-default.xml file.
 It has a default setting of 5.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message