nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Andrzej Bialecki (JIRA)" <>
Subject [jira] Resolved: (NUTCH-69) ignored
Date Fri, 08 Jul 2005 14:39:10 GMT
     [ ]
Andrzej Bialecki  resolved NUTCH-69:

    Resolution: Invalid

This behaviour is caused by improper configuration. When crawling less hosts than (fetcher
threads / threads per host), some threads will always be blocked. Solution: change configuration
to use less threads, or more threads per host, or increase the max.http.delay so that blocked
threads would wait longer..

> ignored
> --------------------------------
>          Key: NUTCH-69
>          URL:
>      Project: Nutch
>         Type: Bug
>   Components: fetcher
>     Reporter: Matthias Jaekle

> Fetcher ignores 'maximum threads per host'.
> If you fetch less domains with multiple threads, some webservers feel attacked or could
not serve you any more.
> So you loose lots of existing pages in your segments.

This message is automatically generated by JIRA.
If you think it was sent incorrectly contact one of the administrators:
For more information on JIRA, see:

View raw message