nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Alexis (JIRA)" <j...@apache.org>
Subject [jira] Created: (NUTCH-965) Parsing takes up 100% CPU
Date Tue, 08 Feb 2011 17:38:57 GMT
Parsing takes up 100% CPU
-------------------------

                 Key: NUTCH-965
                 URL: https://issues.apache.org/jira/browse/NUTCH-965
             Project: Nutch
          Issue Type: Improvement
          Components: parser
            Reporter: Alexis


The issue you're likely to run into when parsing truncated FLV files is described here:
http://www.mail-archive.com/user@nutch.apache.org/msg01880.html

The parser library gets stuck in infinite loop as it encounters corrupted data due to for
example truncating big binary files at fetch time.

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message