nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "mawanqiang (JIRA)" <>
Subject [jira] Created: (NUTCH-742) Checksum Error
Date Sat, 20 Jun 2009 09:18:07 GMT
Checksum Error 

                 Key: NUTCH-742
             Project: Nutch
          Issue Type: Bug
          Components: indexer
    Affects Versions: 1.0.0
         Environment: linux ubuntu8.0.4 64bit 
10datanode 4G of memory per node 
            Reporter: mawanqiang

Approximately 1 million data used to create index when nutch1.0 error.
The error is:
java.lang.RuntimeException: problem advancing post rec#6758513
at org.apache.hadoop.mapred.Task$
at org.apache.hadoop.mapred.ReduceTask$ReduceValuesIterator.moveToNext(
at org.apache.hadoop.mapred.ReduceTask$
at org.apache.nutch.indexer.IndexerMapReduce.reduce(
at org.apache.nutch.indexer.IndexerMapReduce.reduce(
at org.apache.hadoop.mapred.Child.main(
Caused by: org.apache.hadoop.fs.ChecksumException: Checksum Error
at org.apache.hadoop.mapred.IFileInputStream.doRead(
at org.apache.hadoop.mapred.IFile$Reader.readData(
at org.apache.hadoop.mapred.IFile$Reader.rejigData(
at org.apache.hadoop.mapred.IFile$Reader.readNextBlock(
at org.apache.hadoop.mapred.IFile$
at org.apache.hadoop.mapred.Merger$
at org.apache.hadoop.mapred.Merger$MergeQueue.adjustPriorityQueue(
at org.apache.hadoop.mapred.Merger$
at org.apache.hadoop.mapred.Task$ValuesIterator.readNextKey(
at org.apache.hadoop.mapred.Task$
... 6 more

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message