nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Asitang Mishra (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (NUTCH-2110) Create the capability to provide seeds in the form of "url+xpath(including option to enter seach terms).selenium"
Date Mon, 28 Sep 2015 19:35:04 GMT

    [ https://issues.apache.org/jira/browse/NUTCH-2110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14933845#comment-14933845
] 

Asitang Mishra commented on NUTCH-2110:
---------------------------------------

To keep everything under one single url in the end (how it practically is) or under some new
concocted url I think is the question. I am not sure if in the end one needs to distinguish
all this data into separate parts or not. Here we need to think more I guess.
Meanwhile, I created two more sub tasks that can do more specific things using standardized
key value pairs to the injector. Let us focus on them right now and then we can move back
here to this issue which is a little abstract.

> Create the capability to provide seeds in the form of "url+xpath(including option to
enter seach terms).selenium" 
> ------------------------------------------------------------------------------------------------------------------
>
>                 Key: NUTCH-2110
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2110
>             Project: Nutch
>          Issue Type: Sub-task
>          Components: fetcher
>    Affects Versions: 1.10
>            Reporter: Asitang Mishra
>              Labels: memex
>
> Create the capability to provide seeds in the form of "url+xpath(including option to
enter seach terms).selenium" to be used by selenium protocols/plugins as urls/flow to reach
to a specific ajax based page or save the state of a selenium operation for the next fetching
round.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message