nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Markus Jelsma (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (NUTCH-941) Search returns blank page, when there is more than one SOLR server configured
Date Fri, 01 Apr 2011 14:35:07 GMT

     [ https://issues.apache.org/jira/browse/NUTCH-941?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Markus Jelsma updated NUTCH-941:
--------------------------------


Bulk close of legacy issues:
http://www.lucidimagination.com/search/document/2738eeb014805854/clean_up_open_legacy_issues_in_jira

> Search returns blank page, when there is more than one SOLR server configured 
> ------------------------------------------------------------------------------
>
>                 Key: NUTCH-941
>                 URL: https://issues.apache.org/jira/browse/NUTCH-941
>             Project: Nutch
>          Issue Type: Bug
>          Components: searcher
>    Affects Versions: 1.2
>            Reporter: Charan Malemarpuram
>         Attachments: NUTCH-941.patch
>
>
> Search returns a blank page throwing the following exception in the log file.
> java.lang.RuntimeException: Missing hit details! Found: 7, expecting: 8
>         at org.apache.nutch.searcher.SolrSearchBean.getDetails(SolrSearchBean.java:175)
>         at org.apache.nutch.searcher.DistributedSearchBean$DetailTask.call(DistributedSearchBean.java:92)
> This happens, when there is more than one SOLR server configured for search.
> Root cause of this issue is the 
>  NutchBean dedup logic does a "contains" check on a Map of Hit objects . 
>  Hit objects do not have hashcode and equals implemented. It is matching by reference,

>  When NutchBean requests for more hits to process site based result grouping,  it gets
a new object every time from SOLR result and the whole logic breaks.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message