nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Chris A. Mattmann (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (NUTCH-2062) Add Plugin for interacting with Selenium WebDriver
Date Sun, 02 Aug 2015 23:10:05 GMT

    [ https://issues.apache.org/jira/browse/NUTCH-2062?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14651266#comment-14651266
] 

Chris A. Mattmann commented on NUTCH-2062:
------------------------------------------

Thanks [~mjoyce]! All committed:

{noformat}
[chipotle:~/tmp/nutch-trunk] mattmann% svn commit -m "Fix for NUTCH-2062: Add Plugin for interacting
with Selenium WebDriver contributed by Michael Joyce <mltjoyce@gmail.com> this closes
#46"
Sending        build.xml
Sending        conf/nutch-default.xml
Sending        src/plugin/build.xml
Sending        src/plugin/lib-selenium/src/java/org/apache/nutch/protocol/selenium/HttpWebClient.java
Adding         src/plugin/protocol-interactiveselenium
Adding         src/plugin/protocol-interactiveselenium/README.md
Adding         src/plugin/protocol-interactiveselenium/build-ivy.xml
Adding         src/plugin/protocol-interactiveselenium/build.xml
Adding         src/plugin/protocol-interactiveselenium/ivy.xml
Adding         src/plugin/protocol-interactiveselenium/plugin.xml
Adding         src/plugin/protocol-interactiveselenium/src
Adding         src/plugin/protocol-interactiveselenium/src/java
Adding         src/plugin/protocol-interactiveselenium/src/java/org
Adding         src/plugin/protocol-interactiveselenium/src/java/org/apache
Adding         src/plugin/protocol-interactiveselenium/src/java/org/apache/nutch
Adding         src/plugin/protocol-interactiveselenium/src/java/org/apache/nutch/protocol
Adding         src/plugin/protocol-interactiveselenium/src/java/org/apache/nutch/protocol/interactiveselenium
Adding         src/plugin/protocol-interactiveselenium/src/java/org/apache/nutch/protocol/interactiveselenium/Http.java
Adding         src/plugin/protocol-interactiveselenium/src/java/org/apache/nutch/protocol/interactiveselenium/HttpResponse.java
Adding         src/plugin/protocol-interactiveselenium/src/java/org/apache/nutch/protocol/interactiveselenium/handlers
Adding         src/plugin/protocol-interactiveselenium/src/java/org/apache/nutch/protocol/interactiveselenium/handlers/DefaultHandler.java
Adding         src/plugin/protocol-interactiveselenium/src/java/org/apache/nutch/protocol/interactiveselenium/handlers/InteractiveSeleniumHandler.java
Adding         src/plugin/protocol-interactiveselenium/src/java/org/apache/nutch/protocol/interactiveselenium/package.html
Transmitting file data ..............
Committed revision 1693837.
[chipotle:~/tmp/nutch-trunk] mattmann% 
{noformat}


> Add Plugin for interacting with Selenium WebDriver
> --------------------------------------------------
>
>                 Key: NUTCH-2062
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2062
>             Project: Nutch
>          Issue Type: Improvement
>          Components: plugin
>    Affects Versions: 1.10
>            Reporter: Michael Joyce
>            Assignee: Chris A. Mattmann
>              Labels: memex
>             Fix For: 1.11
>
>         Attachments: NUTCH-2062v2.patch
>
>
> The protocol-selenium plugin is great for pulling webpages that dynamically load content.
However, I've run into use cases where I need to actively interact with a page in Selenium
before it becomes useful. For instance, I may need to paginate through a table to get all
results that I'm interested in. This plugin will handle that use case.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message