nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Zhang JinYan (Commented) (JIRA)" <>
Subject [jira] [Commented] (NUTCH-1138) remove LogUtil from trunk and nutch gora
Date Tue, 01 Nov 2011 17:11:33 GMT


Zhang JinYan commented on NUTCH-1138:

Apply the path to branch-1.4, rebuild with cmd: "ant clean build".
Config to crawl websites:

The previous two sites are not available.
Run crawl with cmd(platform windows):
sh.exe ./bin/nutch crawl seedurl -dir crawldev -solr http://localhost:8983/solr/

Complete the crawl successfully.Query int solr admin return:
<result name="response" numFound="320" start="0"></result>

Check the hadoop.log, search word "ERROR",find 3 results caused by:
{code} Connection timed out: connect

Search word "Exception", find results like this:
2011-11-02 00:39:01,821 INFO  httpclient.HttpMethodDirector - I/O exception (org.apache.commons.httpclient.NoHttpResponseException)
caught when processing request: The server failed to respond
2011-11-02 00:39:01,821 INFO  httpclient.HttpMethodDirector - Retrying request

So there is no exception related your path in the "hadoop.log".
The path work fine with "branch-1.4" for me.
> remove LogUtil from trunk and nutch gora
> ----------------------------------------
>                 Key: NUTCH-1138
>                 URL:
>             Project: Nutch
>          Issue Type: Improvement
>    Affects Versions: 1.4, nutchgora
>            Reporter: Lewis John McGibbney
>            Assignee: Lewis John McGibbney
>            Priority: Minor
>             Fix For: nutchgora, 1.5
>         Attachments: Document1.txt, NUTCH-1138-trunk-20111023.patch
> This should move towards the removal of the LogUtil class from both codebases as per
comments in NUTCH-1078.

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:!default.jspa
For more information on JIRA, see:


View raw message