nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Taichi Ho <heyuehengtai...@gmail.com>
Subject Re: [MASSMAIL]Re: Fetch failed : java.lang.NullPointerException
Date Sat, 03 Oct 2015 04:01:57 GMT
HI, Roannel,

I have the following enabled:
protocol-selenium|protocol-interactiveselenium|protocol-http|urlfilter-regex|parse-(html|tika)|index-(basic|anchor)|indexer-solr|scoring-opic|urlnormalizer-(pass|regex|basic)

I now think it is because of my network environment. It hasn't happened
ever since.

Thank you.

On Thu, Oct 1, 2015 at 6:21 AM Roannel Fern�ndez Hern�ndez <roannel@uci.cu>
wrote:

> Hi Taichi:
>
> Which plugins you have enabled in nutch-site.xml?
>
> ------------------------------
> *De: *"Taichi Ho" <heyuehengtaichi@gmail.com>
> *Para: *dev@nutch.apache.org
> *Enviados: *Miércoles, 30 de Septiembre 2015 16:57:39
> *Asunto: *[MASSMAIL]Re: Fetch failed : java.lang.NullPointerException
>
>
> Hi, I have the same problem. The following is part of my log:
> http://pastebin.com/JjkJ1qe6
>
> It seems there is a read time out. But I paste the url in the browser and
> it works fine.
>
> Any ideas what could be causing this problem?
>
> Thanks.
>
> On Mon, Sep 28, 2015 at 7:46 AM Michael Joyce <joyce@apache.org> wrote:
>
>> I don't see any null pointer exceptions coming up in your log. Do you
>> have any more info or perhaps I'm missing something?
>>
>>
>> -- Jimmy
>>
>> On Sun, Sep 27, 2015 at 3:04 PM, mithun <mithun626497@gmail.com> wrote:
>>
>>> Hi All
>>>
>>> While crawling my seed list, I bumped into this Null Pointer Exception
>>> for few urls. What could be the problem.
>>>
>>> Please find paste.bin link of my hadoop.log file
>>>
>>> http://pastebin.com/SyyybtEx
>>>
>>>
>>> Thanks
>>> Mithun
>>>
>>
>>

Mime
View raw message