nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Lewis John McGibbney (JIRA)" <j...@apache.org>
Subject [jira] [Assigned] (NUTCH-2273) Selenium and InteractiveSelenium Do Not Support HTTPS
Date Mon, 13 Jun 2016 19:30:30 GMT

     [ https://issues.apache.org/jira/browse/NUTCH-2273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Lewis John McGibbney reassigned NUTCH-2273:
-------------------------------------------

    Assignee: Lewis John McGibbney

> Selenium and InteractiveSelenium Do Not Support HTTPS
> -----------------------------------------------------
>
>                 Key: NUTCH-2273
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2273
>             Project: Nutch
>          Issue Type: Bug
>          Components: plugin
>    Affects Versions: 1.11
>            Reporter: Brian Zhao
>            Assignee: Lewis John McGibbney
>
> Both Selenium and InteractiveSelenium plugins do not have the https protocol specified
in their plugin.xml, and will not fetch https links.
> To fix for the Selenium plugin you should add: 
>   
>       <implementation id="org.apache.nutch.protocol.selenium.Http"
>                       class="org.apache.nutch.protocol.selenium.Http">
>          <parameter name="protocolName" value="https"/>
>       </implementation>
> to Selenium's plugin.xml (as a child element of the "extension" element)
> An implementation already exists in protocol-http HttpResponse.java, and I've merged
it into selenium's HttpResponse.java here: http://pastebin.com/ZAPfwee4
> This should probably be similarly done for the InteractiveSelenium plugin.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message