nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Emmanuel Joke (JIRA)" <>
Subject [jira] Commented: (NUTCH-531) Pages with no ContentType cause a Null Pointer exception
Date Fri, 04 Jan 2008 07:57:34 GMT


Emmanuel Joke commented on NUTCH-531:

It looks like this issue has been solved with the integration of the framwerok Tika. 
I guess we could close it. Isn't it? 

> Pages with no ContentType cause a Null Pointer exception
> --------------------------------------------------------
>                 Key: NUTCH-531
>                 URL:
>             Project: Nutch
>          Issue Type: Bug
>          Components: fetcher
>    Affects Versions: 0.9.0, 1.0.0
>            Reporter: Carl Cerecke
>         Attachments: NUTCH-531-draft.patch
> Some pages cause a null pointer exception because the contentType is missing (e.g.
> The solution that I used was to change line 165 (trunk) of to:
> Text.writeString(out, contentType != null?contentType:"");
> rfc2616 states this should be application/octet-stream if we don't know the content type,
and can't figure it out.
> But perhaps the problem is in getContentType() at line 281 (trunk). I don't yet know
enough of how it is connected together to determine where the best place for fixing this bug

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message