nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Lewis John McGibbney (JIRA)" <j...@apache.org>
Subject [jira] [Created] (NUTCH-1395) Show batchId when skipping within ParserJob
Date Thu, 14 Jun 2012 12:29:42 GMT
Lewis John McGibbney created NUTCH-1395:
-------------------------------------------

             Summary: Show batchId when skipping within ParserJob
                 Key: NUTCH-1395
                 URL: https://issues.apache.org/jira/browse/NUTCH-1395
             Project: Nutch
          Issue Type: Bug
          Components: crawldb, parser
    Affects Versions: nutchgora
            Reporter: Lewis John McGibbney
            Priority: Minor
             Fix For: 2.1


Although the ParserJob CLI has been smartened up, logging still lets us down where we are
only teased with the 'different batch id' for an url which is skipped.
{code}
Parsing http://www.trancearoundtheworld.com/tatw/399
Parsing http://www.trancearoundtheworld.com/index.php
Skipping http://www.aboveandbeyond.nu/music; different batch id
Parsing http://www.trancearoundtheworld.com/tatw/425
Parsing http://www.trancearoundtheworld.com/tatw/398
Parsing https://twitter.com/tatw
Parsing http://www.trancearoundtheworld.com/tatw/401
{code}

I would like to see
{code}
Skipping http://www.aboveandbeyond.nu/music; different batch id ($batchId)
{code}


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message