nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Taichi Ho <heyuehengtai...@gmail.com>
Subject Re: Redirection in nutch
Date Mon, 05 Oct 2015 06:36:08 GMT
Thanks for pointing out. But it seems I can't get your patch to work
directly with git apply. Ended up creating my own version of patch.


On Sun, Oct 4, 2015 at 11:35 AM Sebastian Nagel <wastl.nagel@googlemail.com>
wrote:

> Hi,
>
> yes, this is a bug which has been fixed in the commit you mentioned
> but reappeared again. Sorry,
> see https://issues.apache.org/jira/browse/NUTCH-2124,
> you'll also find a patch there. The fix will be
> included in 1.11 for sure.
>
> Thanks,
> Sebastian
>
> On 10/03/2015 09:22 AM, Taichi Ho wrote:
> > Hi, all.
> >
> > I want to handle redirection in nutch and found this website that hasn't
> been updated for years.
> > http://wiki.apache.org/nutch/RedirectHandling
> >
> > I tried to set http.redirect.max as the following says:
> >
> http://stackoverflow.com/questions/17592948/nutch-redirection-handling-issue
> >
> > But the redirection doesn't work and nutch keep fetching the same url
> until the max count is reached.
> >
> > Is this a bug?
> > It seems to have been fixed in 1.10. But it still doesn't work.
> >
> https://github.com/apache/nutch/commit/ed052df8822380ccfa89a9ffa1df324933669a59
> >
> > Thanks.
>
>

Mime
View raw message