nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Julien Nioche (JIRA)" <j...@apache.org>
Subject [jira] Assigned: (NUTCH-900) Confusion in nutch-default between http.content.limit and file.content.limit
Date Wed, 08 Sep 2010 11:02:33 GMT

     [ https://issues.apache.org/jira/browse/NUTCH-900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Julien Nioche reassigned NUTCH-900:
-----------------------------------

    Assignee: Julien Nioche

> Confusion in nutch-default between http.content.limit and file.content.limit
> ----------------------------------------------------------------------------
>
>                 Key: NUTCH-900
>                 URL: https://issues.apache.org/jira/browse/NUTCH-900
>             Project: Nutch
>          Issue Type: Improvement
>    Affects Versions: 1.2, 2.0
>            Reporter: Markus Jelsma
>            Assignee: Julien Nioche
>            Priority: Trivial
>             Fix For: 1.2, 2.0
>
>         Attachments: NUTCH-900.MarkusJelsma.100908.patch.txt
>
>
> The http.content.limit and file.content.limit settings can be confusing and have fooled
at least several users. The description element for these settings should be changed to reflect
the difference between them so users won't be fooled that easy.
> See also: http://lucene.472066.n3.nabble.com/ERROR-tika-TikaParser-org-apache-pdfbox-io-PushBackInputStream-td964353.html
for a discussion.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message