nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Roman P (JIRA)" <>
Subject [jira] [Created] (NUTCH-2028) java.lang.IllegalArgumentException: can't serialize class org.apache.avro.util.Utf8
Date Sun, 31 May 2015 16:34:17 GMT
Roman P created NUTCH-2028:

             Summary: java.lang.IllegalArgumentException: can't serialize class org.apache.avro.util.Utf8
                 Key: NUTCH-2028
             Project: Nutch
          Issue Type: Bug
          Components: fetcher
    Affects Versions: 2.3
         Environment: Mac 10.10.3 Nutch 2.3
            Reporter: Roman P

Compiled Nutch 2.3 with MongoDB as a persistence. Getting exception when fetching. Searched
for similar errors online, noticed that this issue was addressed in gora 0.6. Tried recompiling
with 0.6 but then getting different exception, seems that it's incompatible with hadoop 1.2.0.
Tried different versions of hadoop with no luck.

FetcherJob: starting at 2015-05-31 09:29:04
FetcherJob: batchId: all
FetcherJob: threads: 10
FetcherJob: parsing: false
FetcherJob: resuming: false
FetcherJob : timelimit set for : -1
java.lang.IllegalArgumentException: can't serialize class org.apache.avro.util.Utf8
	at org.bson.BasicBSONEncoder._putObjectField(
	at org.bson.BasicBSONEncoder.putObject(
	at org.bson.BasicBSONEncoder.putObject(
	at com.mongodb.DefaultDBEncoder.writeObject(
	at com.mongodb.OutMessage.putObject(
	at com.mongodb.OutMessage.writeQuery(
	at com.mongodb.OutMessage.query(
	at com.mongodb.DBCollectionImpl.find(
	at com.mongodb.DBCollectionImpl.find(
	at com.mongodb.DBCursor._check(
	at com.mongodb.DBCursor._hasNext(
	at com.mongodb.DBCursor.hasNext(
	at org.apache.gora.mongodb.query.MongoDBResult.nextInner(
	at org.apache.gora.mapreduce.GoraRecordReader.nextKeyValue(
	at org.apache.hadoop.mapred.MapTask$NewTrackingRecordReader.nextKeyValue(
	at org.apache.hadoop.mapreduce.MapContext.nextKeyValue(
	at org.apache.hadoop.mapred.MapTask.runNewMapper(
	at org.apache.hadoop.mapred.LocalJobRunner$Job$
	at java.util.concurrent.Executors$
	at java.util.concurrent.ThreadPoolExecutor.runWorker(
	at java.util.concurrent.ThreadPoolExecutor$

This message was sent by Atlassian JIRA

View raw message