nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Feng \(Michael\) Ji" <fji...@yahoo.com>
Subject Re: [Nutch-dev] Re: a silly question
Date Sat, 16 Jul 2005 21:06:48 GMT
That works finally,

Thank you so much,

Michael,

--- Piotr Kosiorowski <pkosiorowski@gmail.com> wrote:

> Hello,
> 
> I understood you have all your segments in
> /home/fji/SE/nutch-nightly/crawl.test/
> but according to log file you sent nutch is looking
> in:
> /home/fji/SE/tomcat4/segments
> Please copy your segment directory from 
> /home/fji/SE/nutch-nightly/crawl.test/
> to
> /home/fji/SE/tomcat4/
> and restart tomcat.
> Regards
> Piotr
> 
> 
> Feng (Michael) Ji wrote:
> > thanks all the suggestions,
> > 
> > I caught the catalina.out as following:
> > 
> > "Jul 16, 2005 3:09:27 PM
> > org.apache.coyote.http11.Http11Protocol init
> > INFO: Initializing Coyote HTTP/1.1 on http-8888
> > Starting service Tomcat-Standalone
> > Apache Tomcat/4.1.31
> > Jul 16, 2005 3:09:29 PM
> > org.apache.struts.util.PropertyMessageResources
> <init>
> > INFO: Initializing,
> > config='org.apache.struts.util.LocalStrings',
> > returnNull=true
> > Jul 16, 2005 3:09:29 PM
> > org.apache.struts.util.PropertyMessageResources
> <init>
> > INFO: Initializing,
> > config='org.apache.struts.action.ActionResources',
> > returnNull=true
> > Jul 16, 2005 3:09:29 PM
> > org.apache.struts.util.PropertyMessageResources
> <init>
> > INFO: Initializing,
> >
>
config='org.apache.webapp.admin.ApplicationResources',
> > returnNull=true
> > Jul 16, 2005 3:09:35 PM
> > org.apache.coyote.http11.Http11Protocol start
> > INFO: Starting Coyote HTTP/1.1 on http-8888
> > Jul 16, 2005 3:09:35 PM
> > org.apache.jk.common.ChannelSocket init
> > INFO: JK2: ajp13 listening on /0.0.0.0:8009
> > Jul 16, 2005 3:09:35 PM
> org.apache.jk.server.JkMain
> > start
> > INFO: Jk running ID=0 time=0/81 
> > config=/home/fji/SE/tomcat4/conf/jk2.properties
> > 050716 150937 parsing
> >
>
file:/home/fji/SE/tomcat4/webapps/ROOT/WEB-INF/classes/nutch-default.xml
> > 050716 150937 parsing
> >
>
file:/home/fji/SE/tomcat4/webapps/ROOT/WEB-INF/classes/nutch-site.xml
> > 050716 150937 Plugins: looking in:
> >
>
/home/fji/SE/tomcat4/webapps/ROOT/WEB-INF/classes/plugins
> > 050716 150937 not including:
> >
>
/home/fji/SE/tomcat4/webapps/ROOT/WEB-INF/classes/plugins/clustering-carrot2
> > 050716 150937 not including:
> >
>
/home/fji/SE/tomcat4/webapps/ROOT/WEB-INF/classes/plugins/creativecommons
> > 050716 150937 parsing:
> >
>
/home/fji/SE/tomcat4/webapps/ROOT/WEB-INF/classes/plugins/index-basic/plugin.xml
> > 050716 150937 impl:
> > point=org.apache.nutch.indexer.IndexingFilter
> >
>
class=org.apache.nutch.indexer.basic.BasicIndexingFilter
> > 050716 150937 not including:
> >
>
/home/fji/SE/tomcat4/webapps/ROOT/WEB-INF/classes/plugins/index-more
> > 050716 150937 not including:
> >
>
/home/fji/SE/tomcat4/webapps/ROOT/WEB-INF/classes/plugins/language-identifier
> > 050716 150937 not including:
> >
>
/home/fji/SE/tomcat4/webapps/ROOT/WEB-INF/classes/plugins/ontology
> > 050716 150937 not including:
> >
>
/home/fji/SE/tomcat4/webapps/ROOT/WEB-INF/classes/plugins/parse-ext
> > 050716 150937 parsing:
> >
>
/home/fji/SE/tomcat4/webapps/ROOT/WEB-INF/classes/plugins/parse-html/plugin.xml
> > 050716 150937 impl:
> > point=org.apache.nutch.parse.Parser
> > class=org.apache.nutch.parse.html.HtmlParser
> > 050716 150937 parsing:
> >
>
/home/fji/SE/tomcat4/webapps/ROOT/WEB-INF/classes/plugins/parse-js/plugin.xml
> > 050716 150937 impl:
> > point=org.apache.nutch.parse.Parser
> > class=org.apache.nutch.parse.js.JSParseFilter
> > 050716 150937 impl:
> > point=org.apache.nutch.parse.HtmlParseFilter
> > class=org.apache.nutch.parse.js.JSParseFilter
> > 050716 150937 not including:
> >
>
/home/fji/SE/tomcat4/webapps/ROOT/WEB-INF/classes/plugins/parse-msword
> > 050716 150937 not including:
> >
>
/home/fji/SE/tomcat4/webapps/ROOT/WEB-INF/classes/plugins/parse-pdf
> > 050716 150937 parsing:
> >
>
/home/fji/SE/tomcat4/webapps/ROOT/WEB-INF/classes/plugins/parse-text/plugin.xml
> > 050716 150937 impl:
> > point=org.apache.nutch.parse.Parser
> > class=org.apache.nutch.parse.text.TextParser
> > 050716 150937 not including:
> >
>
/home/fji/SE/tomcat4/webapps/ROOT/WEB-INF/classes/plugins/protocol-file
> > 050716 150937 not including:
> >
>
/home/fji/SE/tomcat4/webapps/ROOT/WEB-INF/classes/plugins/protocol-ftp
> > 050716 150937 not including:
> >
>
/home/fji/SE/tomcat4/webapps/ROOT/WEB-INF/classes/plugins/protocol-http
> > 050716 150937 parsing:
> >
>
/home/fji/SE/tomcat4/webapps/ROOT/WEB-INF/classes/plugins/protocol-httpclient/plugin.xml
> > 050716 150937 impl:
> > point=org.apache.nutch.protocol.Protocol
> > class=org.apache.nutch.protocol.httpclient.Http
> > 050716 150937 impl:
> > point=org.apache.nutch.protocol.Protocol
> > class=org.apache.nutch.protocol.httpclient.Http
> > 050716 150937 parsing:
> >
>
/home/fji/SE/tomcat4/webapps/ROOT/WEB-INF/classes/plugins/query-basic/plugin.xml
> > 050716 150937 impl:
> > point=org.apache.nutch.searcher.QueryFilter
> >
>
class=org.apache.nutch.searcher.basic.BasicQueryFilter
> > 050716 150937 not including:
> >
>
/home/fji/SE/tomcat4/webapps/ROOT/WEB-INF/classes/plugins/query-more
> > 050716 150937 parsing:
> >
>
/home/fji/SE/tomcat4/webapps/ROOT/WEB-INF/classes/plugins/query-site/plugin.xml
> > 050716 150937 impl:
> > point=org.apache.nutch.searcher.QueryFilter
> >
> class=org.apache.nutch.searcher.site.SiteQueryFilter
> > 050716 150937 parsing:
> >
>
/home/fji/SE/tomcat4/webapps/ROOT/WEB-INF/classes/plugins/query-url/plugin.xml
> > 050716 150937 impl:
> > point=org.apache.nutch.searcher.QueryFilter
> > class=org.apache.nutch.searcher.url.URLQueryFilter
> > 050716 150937 not including:
> >
>
/home/fji/SE/tomcat4/webapps/ROOT/WEB-INF/classes/plugins/urlfilter-prefix
> > 050716 150937 parsing:
> >
>
/home/fji/SE/tomcat4/webapps/ROOT/WEB-INF/classes/plugins/urlfilter-regex/plugin.xml
> > 050716 150937 impl:
> > point=org.apache.nutch.net.URLFilter
> > class=org.apache.nutch.net.RegexURLFilter
> > 050716 150937 11 creating new bean
> > 050716 150937 11 opening segment indexes in
> > /home/fji/SE/tomcat4/segments
> > "
> > 
> > I didn't see any complain, or I miss something
> > important?
> > 
> > The browser gives me the same error message as
> before.
> > 
> > Michael,
> > 
> > --- yoursoft <yoursoft@freemail.hu> wrote:
> > 
> > 
> >>I think please check your log/catalina.out for
> more
> >>details of error.
> >>
> >>Howie Wang wrotte:
> >>
> >>
> >>>>hi howie:
> >>>>
> >>>>For search.dir value of both nutch-default.xml
> >>
> >>and
> >>
> >>>>nutch-site.xml;
> >>>
> >>>
> >
>
<value>/home/fji/SE/nutch-nightly/crawl.test/</value>
> 
=== message truncated ===


__________________________________________________
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com 

Mime
View raw message