nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Markus Haense <mhae...@gmail.com>
Subject Re: NullPointerException mapred
Date Fri, 10 Apr 2009 12:56:42 GMT
It looks like when you write your own indexer plugin and forget to add  
the field url in a doc then you get this kind of error :/

Regards,
MyD


On Apr 10, 2009, at 8:11 PM, MyD wrote:

> Hi @ all,
>
> I am using the newest trunk source code. I get every time this error  
> msg:
>
> 2009-04-10 20:08:23,816 INFO  indexer.Indexer - Indexer: done
> 2009-04-10 20:08:23,817 INFO  indexer.DeleteDuplicates - Dedup:  
> starting
> 2009-04-10 20:08:23,818 INFO  indexer.DeleteDuplicates - Dedup:  
> adding indexes in: crawl.dirs/crawl.wikicfp.test/indexes
> 2009-04-10 20:08:23,828 WARN  mapred.JobClient - Use  
> GenericOptionsParser for parsing the arguments. Applications should  
> implement Tool for the
> same.
> 2009-04-10 20:08:24,987 WARN  mapred.LocalJobRunner - job_local_0014
> java.lang.NullPointerException
>        at org.apache.hadoop.io.Text.encode(Text.java:388)
>        at org.apache.hadoop.io.Text.set(Text.java:178)
>        at org.apache.nutch.indexer.DeleteDuplicates$InputFormat 
> $DDRecordReader.next(DeleteDuplicates.java:191)
>        at org.apache.nutch.indexer.DeleteDuplicates$InputFormat 
> $DDRecordReader.next(DeleteDuplicates.java:157)
>        at org.apache.hadoop.mapred.MapTask 
> $TrackedRecordReader.moveToNext(MapTask.java:192)
>        at org.apache.hadoop.mapred.MapTask 
> $TrackedRecordReader.next(MapTask.java:176)
>        at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
>        at org.apache.hadoop.mapred.MapTask.run(MapTask.java:342)
>        at org.apache.hadoop.mapred.LocalJobRunner 
> $Job.run(LocalJobRunner.java:138)
>
>
> Any idea? Thanks in advance.
>
> Regards,
> MyD


Mime
View raw message