nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hudson (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (NUTCH-1771) Solrindex fails if a segment is corrupted or incomplete
Date Thu, 09 Apr 2015 22:51:13 GMT

    [ https://issues.apache.org/jira/browse/NUTCH-1771?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14488432#comment-14488432
] 

Hudson commented on NUTCH-1771:
-------------------------------

SUCCESS: Integrated in Nutch-trunk #3052 (See [https://builds.apache.org/job/Nutch-trunk/3052/])
NUTCH-1771 Indexer fails if a segment is corrupted or incomplete (snagel: http://svn.apache.org/viewvc/nutch/trunk/?view=rev&rev=1672507)
* /nutch/trunk/CHANGES.txt
* /nutch/trunk/conf/log4j.properties
* /nutch/trunk/src/java/org/apache/nutch/indexer/IndexingJob.java
* /nutch/trunk/src/java/org/apache/nutch/segment/SegmentChecker.java


> Solrindex fails if a segment is corrupted or incomplete
> -------------------------------------------------------
>
>                 Key: NUTCH-1771
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1771
>             Project: Nutch
>          Issue Type: Bug
>          Components: indexer
>    Affects Versions: 1.8, 1.10
>            Reporter: Diaa
>            Priority: Minor
>             Fix For: 1.10
>
>
> When using solrindex to index multiple segments via -dir segment,
> the indexing fails if one or more segments are corrupted/incomplete (generated but not
fetched for example)
> The failure is simply java.io exception.
> Deleting the segment fixes the issue.
> The expected behavior should be one of the following:
> * skipping the segment and proceeding with others (while logging)
> * stopping the indexing and logging the failed segment



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message