nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Roman Valls (JIRA)" <>
Subject [jira] Commented: (NUTCH-634) Patch - Nutch - Hadoop 0.17.1
Date Wed, 23 Jul 2008 11:55:34 GMT


Roman Valls commented on NUTCH-634:

Sure, it was my fault :/

ant clean && ant solved the problem, now the crawl is progressing as it should.

Thanks !

PS: I've also ran the test suite and there are errors after cleaning the environment:

hadoop@braintop:~/nutch$ ant test | grep -i failed
    [junit] Test org.apache.nutch.crawl.TestCrawlDbMerger FAILED
    [junit] Test org.apache.nutch.crawl.TestGenerator FAILED
    [junit] Test org.apache.nutch.crawl.TestInjector FAILED
    [junit] Test org.apache.nutch.crawl.TestLinkDbMerger FAILED
    [junit] Test org.apache.nutch.crawl.TestMapWritable FAILED
    [junit] Test org.apache.nutch.fetcher.TestFetcher FAILED
    [junit] Test org.apache.nutch.indexer.TestDeleteDuplicates FAILED
    [junit] Test org.apache.nutch.searcher.TestDistributedSearch FAILED

> Patch - Nutch - Hadoop 0.17.1
> -----------------------------
>                 Key: NUTCH-634
>                 URL:
>             Project: Nutch
>          Issue Type: Improvement
>    Affects Versions: 1.0.0
>            Reporter: Michael Gottesman
>            Assignee: Andrzej Bialecki 
>             Fix For: 1.0.0
>         Attachments: diff, hadoop-0.17.patch, hadoop-0.17.patch, hadoop-0.17.patch
> This is a patch so that Nutch can be used with Hadoop 0.17.0. The patch is located at
> The patch compiles and passes all current Nutch unit tests.
> I have tested that the crawler side of Nutch (i.e. inject, generate, fetch, parse, merge
w/crawldb) definetly works, but have not tested the lucene indexing part. It might work, but
it might not. 
> *NOTE* - the two main bugs that had to be overcome were not noticed by any of the unit
tests. The bugs only came up during actual testing. The bugs were:
> 1. Changes to the Hadoop Iterator
> 2. Addition of Serialization to MapReduce Framework

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message