nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sebastian Nagel (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (NUTCH-1867) CrawlDbReader: use setFloat to pass min score
Date Sun, 05 Oct 2014 20:41:33 GMT

    [ https://issues.apache.org/jira/browse/NUTCH-1867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14159693#comment-14159693
] 

Sebastian Nagel commented on NUTCH-1867:
----------------------------------------

> do we need to state on the conf property description that this is of type float? or is
this not required?
The property is only used to pass the min score to the mapper. It is not listed (and documented)
in nutch-default.xml because setting it in a config file is useless, it is always overwritten
either by the command-line value or by 0.0

> CrawlDbReader: use setFloat to pass min score
> ---------------------------------------------
>
>                 Key: NUTCH-1867
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1867
>             Project: Nutch
>          Issue Type: Improvement
>          Components: crawldb
>    Affects Versions: 1.9
>            Reporter: Sebastian Nagel
>            Priority: Trivial
>             Fix For: 1.10
>
>         Attachments: NUTCH-1867-v1.patch
>
>
> The float value "min" score in the CrawlDbTopNMapper is passed via property "db.reader.topn.min"
as a long (multiplied by 1Mio.). The comment "no setFloat() in the API" is no longer valid,
the method exists in [Configuration|https://hadoop.apache.org/docs/current/api/org/apache/hadoop/conf/Configuration.html|Configuration]
and should be used. Reported by [~lewismc], see NUTCH-1857.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message