nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Markus Jelsma (JIRA)" <j...@apache.org>
Subject [jira] Created: (NUTCH-900) Confusion in nutch-default between http.content.limit and file.content.limit
Date Wed, 08 Sep 2010 10:06:34 GMT
Confusion in nutch-default between http.content.limit and file.content.limit
----------------------------------------------------------------------------

                 Key: NUTCH-900
                 URL: https://issues.apache.org/jira/browse/NUTCH-900
             Project: Nutch
          Issue Type: Improvement
    Affects Versions: 1.2
            Reporter: Markus Jelsma
            Priority: Trivial
             Fix For: 1.2


The http.content.limit and file.content.limit settings can be confusing and have fooled at
least several users. The description element for these settings should be changed to reflect
the difference between them so users won't be fooled that easy.
See also: http://lucene.472066.n3.nabble.com/ERROR-tika-TikaParser-org-apache-pdfbox-io-PushBackInputStream-td964353.html
for a discussion.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message