nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Steve Yao (JIRA)" <>
Subject [jira] [Created] (NUTCH-2280) HTTP Post form authentication CookiePolicy configuration
Date Wed, 15 Jun 2016 11:41:09 GMT
Steve Yao created NUTCH-2280:

             Summary: HTTP Post form authentication CookiePolicy configuration
                 Key: NUTCH-2280
             Project: Nutch
          Issue Type: New Feature
          Components: protocol
    Affects Versions: 1.11
            Reporter: Steve Yao
            Priority: Minor

The protocol-httpclient plugin supports HTTP form authentication with form values post back
to the assigned login URL and store the session cookie for following content retrieving.
The httpclient default CookiePolicy setting is in use. This default setting will reject cookie
has domain set starting as ".", for example domain="". This kind of domain value
could be accepted by most web browsers. 
I suggest to add an configurable option in conf/httpclient-auth.xml:
{code:xml}<credentials authMethod="formMethod" ...>
    <policy>DEFAULT | BROWSER_COMPATIBILITY | NETSCAPE RFC_2109 | RFC_2965</policy>
Then, the httpclient could take this Cookie policy value.

I am working on a patch for this feature. But before i implement the configuration format
change, i would like to hear any other suggestions or comments. 

This message was sent by Atlassian JIRA

View raw message