nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "misc" <>
Subject Re: bug with generate performance
Date Fri, 07 Sep 2007 23:47:22 GMT


    I've made a bug, and included the extra required information 
( = -1, error seen with small topN around 100 and large 
topN around 1000000).

    I've since tried to run with a debugger, but the slowness went away 
(ugh).  I also know that dns lookups are not the problem as I ran with 
wireshark running and there were no dns lookups.


> Others have also reported a problem with generate performance. It
> seems we have a problem here but I can not reproduce this behaviour so
> I am not sure what causes it. Can you open a JIRA issue and enter your
> comments there? Also, how you are running generate will be very
> helpful (what is what is -topN argument, etc.)

View raw message