From nutch-dev-return-3292-apmail-lucene-nutch-dev-archive=lucene.apache.org@lucene.apache.org Fri Dec 30 22:21:41 2005 Return-Path: Delivered-To: apmail-lucene-nutch-dev-archive@www.apache.org Received: (qmail 72915 invoked from network); 30 Dec 2005 22:21:41 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 30 Dec 2005 22:21:41 -0000 Received: (qmail 45256 invoked by uid 500); 30 Dec 2005 22:21:41 -0000 Delivered-To: apmail-lucene-nutch-dev-archive@lucene.apache.org Received: (qmail 44899 invoked by uid 500); 30 Dec 2005 22:21:39 -0000 Mailing-List: contact nutch-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: nutch-dev@lucene.apache.org Delivered-To: mailing list nutch-dev@lucene.apache.org Received: (qmail 44888 invoked by uid 99); 30 Dec 2005 22:21:39 -0000 Received: from asf.osuosl.org (HELO asf.osuosl.org) (140.211.166.49) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 30 Dec 2005 14:21:39 -0800 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (asf.osuosl.org: domain of canovaj@gmail.com designates 64.233.162.204 as permitted sender) Received: from [64.233.162.204] (HELO zproxy.gmail.com) (64.233.162.204) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 30 Dec 2005 14:21:36 -0800 Received: by zproxy.gmail.com with SMTP id i11so1842204nzh for ; Fri, 30 Dec 2005 14:21:16 -0800 (PST) DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=beta; d=gmail.com; h=received:message-id:date:from:to:subject:in-reply-to:mime-version:content-type:references; b=kFJI4EvBW0vwwTLW+ESFJttuELj7R8LOct/Ks7sWcdu7CmOeDE6vIV6m/3novmP3R7XLTNQWabCr84TjxFQXK0UGxL11lwNKxrUhjNHSLxwTulOYzkXQs5Ub8c32BQwSVt+HJAVxhfmT2gRc/9Z3Q4lrPU9x5aPT5BKDYvW0CJw= Received: by 10.65.204.4 with SMTP id g4mr2559377qbq; Fri, 30 Dec 2005 14:21:15 -0800 (PST) Received: by 10.65.204.2 with HTTP; Fri, 30 Dec 2005 14:21:15 -0800 (PST) Message-ID: <70055a110512301421s601cf960m1e6324ab1ec985ab@mail.gmail.com> Date: Fri, 30 Dec 2005 14:21:15 -0800 From: AJ Chen To: nutch-dev@lucene.apache.org Subject: Re: severe error in fetch In-Reply-To: <43B080B9.40403@getopt.org> MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_Part_19308_31271974.1135981275858" References: <43AF1F67.204@gmail.com> <220A6FCB-0287-4B64-A80B-BBF96F1579E2@media-style.com> <70055a110512251513r5b8b0b21xc53c954fb502022d@mail.gmail.com> <43B080B9.40403@getopt.org> X-Virus-Checked: Checked by ClamAV on apache.org X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N ------=_Part_19308_31271974.1135981275858 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Content-Disposition: inline This problem is recurring. It happens when fetching https://www.kodak.com:0/something. I guess the port number 0 is the cause of the problem because there is no problem fetching https://www.kodak.com/anything. see log entries: 051230 105257 fetching https://www.kodak.com:0/eknec/PageQuerier.jhtml?pq-path=3D2/782/2608/2610/4= 074/7058&pq-locale=3Den_US&_loopback=3D1 051230 105305 SEVERE Host connection pool not found, hostConfig=3DHostConfiguration[host=3Dhttps://www.kodak.com] java.lang.RuntimeException: SEVERE error logged. Exiting fetcher. Is it right that some specific port numbers can cause connection pool problem in httpclient? If yes, I can filter out url containing these troubl= e ports before httpclient is fixed. Thanks, AJ On 12/26/05, Andrzej Bialecki wrote: > > AJ Chen wrote: > > >Stefan, > >Here is the trace in my log. My SSFetcher (for site-specific fetch) is > the > >same as nutch Fetcher except that the URLFilters it uses has additional > >filter based on domain names. Line 363 is > > throw new RuntimeException("SEVERE error logged. Exiting > >fetcher."); > > > > > >051224 075950 SEVERE Host connection pool not found, > >hostConfig=3DHostConfiguration[host=3Dhttps://www.kodak.com] > > > > > > This error comes from the httpclient library (you won't get a better > stacktrace, you need to redefine the java.util.logging properties to get > more info). I'm in the process of upgrading to the latest release, but > it's trivial, you can try it yourself. Hopefully this should solve the > issue. > > -- > Best regards, > Andrzej Bialecki <>< > ___. ___ ___ ___ _ _ __________________________________ > [__ || __|__/|__||\/| Information Retrieval, Semantic Web > ___|||__|| \| || | Embedded Unix, System Integration > http://www.sigram.com Contact: info at sigram dot com > > > ------=_Part_19308_31271974.1135981275858--