Confusion in nutch-default between http.content.limit and file.content.limit
----------------------------------------------------------------------------
Key: NUTCH-900
URL: https://issues.apache.org/jira/browse/NUTCH-900
Project: Nutch
Issue Type: Improvement
Affects Versions: 1.2
Reporter: Markus Jelsma
Priority: Trivial
Fix For: 1.2
The http.content.limit and file.content.limit settings can be confusing and have fooled at
least several users. The description element for these settings should be changed to reflect
the difference between them so users won't be fooled that easy.
See also: http://lucene.472066.n3.nabble.com/ERROR-tika-TikaParser-org-apache-pdfbox-io-PushBackInputStream-td964353.html
for a discussion.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
|