nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Albinscode <albinsc...@gmail.com>
Subject Re: [jira] [Updated] (NUTCH-1644) Should have a parser that uses xpath
Date Sat, 01 Nov 2014 20:48:50 GMT
Hello everybody,

If some more efforts are to be done on NUTCH-1740, I'll be glad to
help. I developed this plugin because I was amongst people that didn't
want to create new plugins just for few metadata extraction matters ;)

2014-11-01 19:47 GMT+01:00 Lewis John McGibbney (JIRA) <jira@apache.org>:
>
>      [ https://issues.apache.org/jira/browse/NUTCH-1644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
>
> Lewis John McGibbney updated NUTCH-1644:
> ----------------------------------------
>     Fix Version/s:     (was: 2.3)
>                    2.4
>
>> Should have a parser that uses xpath
>> ------------------------------------
>>
>>                 Key: NUTCH-1644
>>                 URL: https://issues.apache.org/jira/browse/NUTCH-1644
>>             Project: Nutch
>>          Issue Type: New Feature
>>          Components: parser
>>    Affects Versions: 2.2.1
>>            Reporter: cihad g├╝zel
>>            Assignee: Lewis John McGibbney
>>              Labels: parser, xpath
>>             Fix For: 2.4
>>
>>         Attachments: NUTCH-1644.patch
>>
>>
>> May want to parse some url via xpath. May be blog or news web sites. Should be a
plugin using xpath parse.
>
>
>
> --
> This message was sent by Atlassian JIRA
> (v6.3.4#6332)

Mime
View raw message