nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Emmanuel Joke (JIRA)" <j...@apache.org>
Subject [jira] Updated: (NUTCH-529) NodeWalker.skipChildren doesn't work for more than 1 child.
Date Fri, 21 Sep 2007 16:05:50 GMT

     [ https://issues.apache.org/jira/browse/NUTCH-529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Emmanuel Joke updated NUTCH-529:
--------------------------------

    Attachment:     (was: TestNodeWalker.java)

> NodeWalker.skipChildren doesn't work for more than 1 child.
> -----------------------------------------------------------
>
>                 Key: NUTCH-529
>                 URL: https://issues.apache.org/jira/browse/NUTCH-529
>             Project: Nutch
>          Issue Type: Bug
>            Reporter: Emmanuel Joke
>             Fix For: 1.0.0
>
>         Attachments: NUTCH-529.patch, TestNodeWalker.java
>
>
> I used NodeWalker to parse an HTML page and skip element like "SELECT" and their children.
I noticed that it didn't skip the "OPTION" element which was the children of the parent SELECT
element. It skipt it if I have only one element but if I have 8 children elements it keep
it.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message