nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hudson (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (NUTCH-2062) Add Plugin for interacting with Selenium WebDriver
Date Sun, 02 Aug 2015 23:55:05 GMT

    [ https://issues.apache.org/jira/browse/NUTCH-2062?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14651295#comment-14651295
] 

Hudson commented on NUTCH-2062:
-------------------------------

SUCCESS: Integrated in Nutch-trunk #3237 (See [https://builds.apache.org/job/Nutch-trunk/3237/])
Changes for NUTCH-2062 (mattmann: http://svn.apache.org/viewvc/nutch/trunk/?view=rev&rev=1693838)
* /nutch/trunk/CHANGES.txt
Fix for NUTCH-2062: Add Plugin for interacting with Selenium WebDriver contributed by Michael
Joyce <mltjoyce@gmail.com> this closes #46 (mattmann: http://svn.apache.org/viewvc/nutch/trunk/?view=rev&rev=1693837)
* /nutch/trunk/build.xml
* /nutch/trunk/conf/nutch-default.xml
* /nutch/trunk/src/plugin/build.xml
* /nutch/trunk/src/plugin/lib-selenium/src/java/org/apache/nutch/protocol/selenium/HttpWebClient.java
* /nutch/trunk/src/plugin/protocol-interactiveselenium
* /nutch/trunk/src/plugin/protocol-interactiveselenium/README.md
* /nutch/trunk/src/plugin/protocol-interactiveselenium/build-ivy.xml
* /nutch/trunk/src/plugin/protocol-interactiveselenium/build.xml
* /nutch/trunk/src/plugin/protocol-interactiveselenium/ivy.xml
* /nutch/trunk/src/plugin/protocol-interactiveselenium/plugin.xml
* /nutch/trunk/src/plugin/protocol-interactiveselenium/src
* /nutch/trunk/src/plugin/protocol-interactiveselenium/src/java
* /nutch/trunk/src/plugin/protocol-interactiveselenium/src/java/org
* /nutch/trunk/src/plugin/protocol-interactiveselenium/src/java/org/apache
* /nutch/trunk/src/plugin/protocol-interactiveselenium/src/java/org/apache/nutch
* /nutch/trunk/src/plugin/protocol-interactiveselenium/src/java/org/apache/nutch/protocol
* /nutch/trunk/src/plugin/protocol-interactiveselenium/src/java/org/apache/nutch/protocol/interactiveselenium
* /nutch/trunk/src/plugin/protocol-interactiveselenium/src/java/org/apache/nutch/protocol/interactiveselenium/Http.java
* /nutch/trunk/src/plugin/protocol-interactiveselenium/src/java/org/apache/nutch/protocol/interactiveselenium/HttpResponse.java
* /nutch/trunk/src/plugin/protocol-interactiveselenium/src/java/org/apache/nutch/protocol/interactiveselenium/handlers
* /nutch/trunk/src/plugin/protocol-interactiveselenium/src/java/org/apache/nutch/protocol/interactiveselenium/handlers/DefaultHandler.java
* /nutch/trunk/src/plugin/protocol-interactiveselenium/src/java/org/apache/nutch/protocol/interactiveselenium/handlers/InteractiveSeleniumHandler.java
* /nutch/trunk/src/plugin/protocol-interactiveselenium/src/java/org/apache/nutch/protocol/interactiveselenium/package.html


> Add Plugin for interacting with Selenium WebDriver
> --------------------------------------------------
>
>                 Key: NUTCH-2062
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2062
>             Project: Nutch
>          Issue Type: Improvement
>          Components: plugin
>    Affects Versions: 1.10
>            Reporter: Michael Joyce
>            Assignee: Chris A. Mattmann
>              Labels: memex
>             Fix For: 1.11
>
>         Attachments: NUTCH-2062v2.patch
>
>
> The protocol-selenium plugin is great for pulling webpages that dynamically load content.
However, I've run into use cases where I need to actively interact with a page in Selenium
before it becomes useful. For instance, I may need to paginate through a table to get all
results that I'm interested in. This plugin will handle that use case.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message