nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Lutischán Ferenc (JIRA) <j...@apache.org>
Subject [jira] Created: (NUTCH-65) index-more plugin can't parse large set of modification-date
Date Fri, 01 Jul 2005 09:55:59 GMT
index-more plugin can't parse large set of  modification-date
-------------------------------------------------------------

         Key: NUTCH-65
         URL: http://issues.apache.org/jira/browse/NUTCH-65
     Project: Nutch
        Type: Bug
  Components: indexer  
 Environment: nutch 0.7, java 1.5, linux
    Reporter: Lutischán Ferenc


I found a problem in MoreIndexingFilter.java.
When I indexing segments, I get large list of error messages:
can't parse errorenous date: Wed, 10 Sep 2003 11:59:14 or
can't parse errorenous date: Wed, 10 Sep 2003 11:59:14GMT

I modifiing source code (I don't make a 'patch'):
Original (lines 137-138):
DateFormat df = new SimpleDateFormat("EEE MMM dd HH:mm:ss yyyy zzz");
Date d = df.parse(date);
New:
DateFormat df = new SimpleDateFormat("EEE, MMM dd HH:mm:ss yyyy", Locale.US);
Date d = df.parse(date.substring(0,25));

The modified code works fine.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


Mime
View raw message