nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Markus Jelsma (JIRA)" <j...@apache.org>
Subject [jira] [Closed] (NUTCH-1107) Log slow parse entries
Date Mon, 18 Jan 2016 20:48:40 GMT

     [ https://issues.apache.org/jira/browse/NUTCH-1107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Markus Jelsma closed NUTCH-1107.
--------------------------------
    Resolution: Won't Fix

> Log slow parse entries
> ----------------------
>
>                 Key: NUTCH-1107
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1107
>             Project: Nutch
>          Issue Type: Improvement
>          Components: parser
>            Reporter: Markus Jelsma
>            Assignee: Markus Jelsma
>            Priority: Trivial
>
> Parse mapper and outputformat should have a facility to log (configurable) slow entries.
This is useful for debugging slow parses. Logging parser keys only is not good enough, especially
in a distributed environment.
> Sometimes the actual parse (mapper) is very slow and sometimes the normalization and
filtering of an entry's outlinks is slow.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message