From dev-return-13215-apmail-nutch-dev-archive=nutch.apache.org@nutch.apache.org Wed Sep 08 11:02:57 2010 Return-Path: Delivered-To: apmail-nutch-dev-archive@www.apache.org Received: (qmail 32852 invoked from network); 8 Sep 2010 11:02:57 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 8 Sep 2010 11:02:57 -0000 Received: (qmail 97490 invoked by uid 500); 8 Sep 2010 11:02:57 -0000 Delivered-To: apmail-nutch-dev-archive@nutch.apache.org Received: (qmail 97203 invoked by uid 500); 8 Sep 2010 11:02:54 -0000 Mailing-List: contact dev-help@nutch.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@nutch.apache.org Delivered-To: mailing list dev@nutch.apache.org Received: (qmail 97192 invoked by uid 99); 8 Sep 2010 11:02:54 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 08 Sep 2010 11:02:54 +0000 X-ASF-Spam-Status: No, hits=-1998.7 required=10.0 tests=ALL_TRUSTED,URI_HEX X-Spam-Check-By: apache.org Received: from [140.211.11.22] (HELO thor.apache.org) (140.211.11.22) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 08 Sep 2010 11:02:53 +0000 Received: from thor (localhost [127.0.0.1]) by thor.apache.org (8.13.8+Sun/8.13.8) with ESMTP id o88B2Xf4001735 for ; Wed, 8 Sep 2010 11:02:33 GMT Message-ID: <8908286.73451283943753165.JavaMail.jira@thor> Date: Wed, 8 Sep 2010 07:02:33 -0400 (EDT) From: "Julien Nioche (JIRA)" To: dev@nutch.apache.org Subject: [jira] Assigned: (NUTCH-900) Confusion in nutch-default between http.content.limit and file.content.limit In-Reply-To: <8019987.72811283940394228.JavaMail.jira@thor> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/NUTCH-900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche reassigned NUTCH-900: ----------------------------------- Assignee: Julien Nioche > Confusion in nutch-default between http.content.limit and file.content.limit > ---------------------------------------------------------------------------- > > Key: NUTCH-900 > URL: https://issues.apache.org/jira/browse/NUTCH-900 > Project: Nutch > Issue Type: Improvement > Affects Versions: 1.2, 2.0 > Reporter: Markus Jelsma > Assignee: Julien Nioche > Priority: Trivial > Fix For: 1.2, 2.0 > > Attachments: NUTCH-900.MarkusJelsma.100908.patch.txt > > > The http.content.limit and file.content.limit settings can be confusing and have fooled at least several users. The description element for these settings should be changed to reflect the difference between them so users won't be fooled that easy. > See also: http://lucene.472066.n3.nabble.com/ERROR-tika-TikaParser-org-apache-pdfbox-io-PushBackInputStream-td964353.html for a discussion. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.