nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hudson (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (NUTCH-2245) Developed the NGram Model on the existing Unigram Cosine Similarity Model
Date Mon, 04 Apr 2016 07:57:26 GMT

    [ https://issues.apache.org/jira/browse/NUTCH-2245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15223795#comment-15223795
] 

Hudson commented on NUTCH-2245:
-------------------------------

FAILURE: Integrated in Nutch-trunk #3358 (See [https://builds.apache.org/job/Nutch-trunk/3358/])
Fix for NUTCH-2245 contributed by Bhavya Sanghavi (sujen: rev 2c42680823079faf87705df4d0698dcf8b43ef66)
* src/plugin/scoring-similarity/src/java/org/apache/nutch/scoring/similarity/cosine/CosineSimilarity.java
* src/plugin/scoring-similarity/src/java/org/apache/nutch/scoring/similarity/cosine/Model.java
* conf/nutch-default.xml
* src/plugin/scoring-similarity/src/java/org/apache/nutch/scoring/similarity/util/LuceneTokenizer.java
Add changes record for NUTCH-2245 Developed the NGram Model on the (sujen: rev b62f43fda3648bd7a37da550a180c3da5a1e3986)
* CHANGES.txt


> Developed the NGram Model on the existing Unigram Cosine Similarity Model
> -------------------------------------------------------------------------
>
>                 Key: NUTCH-2245
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2245
>             Project: Nutch
>          Issue Type: New Feature
>          Components: plugin, scoring
>            Reporter: Bhavya Sanghavi
>            Assignee: Sujen Shah
>            Priority: Minor
>              Labels: memex
>             Fix For: 1.12
>
>
> Built on the existing unigram cosine similarity model by adding the Ngram model, thus
providing flexibility to the user to choose the window size for scoring the similarity between
webpages and the gold standard.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message