nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From faruk berksöz (JIRA) <j...@apache.org>
Subject [jira] Created: (NUTCH-899) java.sql.BatchUpdateException: Data truncation: Data too long for column 'content' at row 1
Date Tue, 07 Sep 2010 13:40:34 GMT
java.sql.BatchUpdateException: Data truncation: Data too long for column 'content' at row 1
-------------------------------------------------------------------------------------------

                 Key: NUTCH-899
                 URL: https://issues.apache.org/jira/browse/NUTCH-899
             Project: Nutch
          Issue Type: Bug
          Components: storage
    Affects Versions: 2.0
         Environment: ubuntu 10.04
JVM : 1.6.0_20
nutch 2.0 (trunk)
Mysql/HBase (0.20.6) / Hadoop(0.20.2) pseudo-distributed 
            Reporter: faruk berksöz
            Priority: Minor


wenn i try to fetch a web page (e.g. http://www.w3.org/Protocols/rfc2616/rfc2616-sec14.html
) with mysql storage definition,
I am seeing the following error in my hadoop logs. ,  (no error with hbase ) ;

java.io.IOException: java.sql.BatchUpdateException: Data truncation: Data too long for column
'content' at row 1
    at org.gora.sql.store.SqlStore.flush(SqlStore.java:316)
    at org.gora.sql.store.SqlStore.close(SqlStore.java:163)
    at org.gora.mapreduce.GoraOutputFormat$1.close(GoraOutputFormat.java:72)
    at org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:567)
    at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:408)
    at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:216)

The type of the column 'content' is BLOB.
It may be important for the next developments of Gora.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message