nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Chirag Chaman" <...@filangy.com>
Subject RE: [jira] Commented: (NUTCH-66) Cookies are not being read properly
Date Tue, 05 Jul 2005 20:38:34 GMT
Andrzej,

This does NOT work.
Still complains when it sees the domain name without a leading period.
 


-----Original Message-----
From: Andrzej Bialecki (JIRA) [mailto:jira@apache.org] 
Sent: Monday, July 04, 2005 12:57 PM
To: nutch-dev@incubator.apache.org
Subject: [jira] Commented: (NUTCH-66) Cookies are not being read properly

    [
http://issues.apache.org/jira/browse/NUTCH-66?page=comments#action_12315027
] 

Andrzej Bialecki  commented on NUTCH-66:
----------------------------------------

If you are using protocol-httpclient, add the following lines to Http.java,
around line 395:

params.setParameter("http.protocol.cookie-policy",
CookiePolicy.BROWSER_COMPATIBILITY);
params.setBooleanParameter("http.protocol.single-cookie-header", true);

Please report if this helps.

> Cookies are not being read properly
> -----------------------------------
>
>          Key: NUTCH-66
>          URL: http://issues.apache.org/jira/browse/NUTCH-66
>      Project: Nutch
>         Type: Improvement
>   Components: fetcher
>     Reporter: CC Chaman
>     Priority: Minor

>
> Cookies that do not begin with a period are not being accepted. For
example "cnn.com" instead of the RFC ".cnn.com". But A LOT of sites seem to
not know the standard. It would be nice if the plugin accepted those cookies
as well.

--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira




Mime
View raw message