nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Dr. Nadine Hochstotter (JIRA)" <j...@apache.org>
Subject [jira] Commented: (NUTCH-694) Distributed Search Server fails
Date Thu, 19 Feb 2009 17:32:02 GMT

    [ https://issues.apache.org/jira/browse/NUTCH-694?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12675053#action_12675053
] 

Dr. Nadine Hochstotter commented on NUTCH-694:
----------------------------------------------

Hi Sami,
I tried my best, but still it doesn't work, I updated all, but whenever I have a search query
which would return results I get a blank page and even if I try this in terminal, there are
no results.
But now I have a new exception. Any ideas? Is this because we do not run Nutch in PseudoDistrubutedMode.
But it worked so fine before. I have no idea.
Thank you for any help.

Client log:
2009-02-19 18:15:33,819 INFO  ipc.RPC - Return: true
2009-02-19 18:15:43,819 INFO  ipc.RPC - Call: ping()
2009-02-19 18:15:43,820 INFO  ipc.RPC - Return: true
2009-02-19 18:15:46,426 INFO  ipc.RPC - Call: getProtocolVersion(org.apache.nutch.searcher.RPCS...
2009-02-19 18:15:46,426 INFO  ipc.RPC - Return: 1
2009-02-19 18:15:46,462 INFO  ipc.RPC - Call: getProtocolVersion(org.apache.nutch.searcher.RPCS...
2009-02-19 18:15:46,462 INFO  ipc.RPC - Return: 1
2009-02-19 18:15:46,462 INFO  ipc.RPC - Call: ping()
2009-02-19 18:15:46,463 INFO  ipc.RPC - Return: true
2009-02-19 18:15:46,481 INFO  ipc.RPC - Call: getSegmentNames()
2009-02-19 18:15:46,481 INFO  ipc.RPC - Return: null
2009-02-19 18:15:46,886 INFO  ipc.RPC - Call: search("DEFAULT der", 10, null, null, false)
2009-02-19 18:15:46,896 INFO  ipc.RPC - Return: org.apache.nutch.searcher.Hits@786c1a82
2009-02-19 18:15:53,819 INFO  ipc.RPC - Call: ping()
2009-02-19 18:15:53,820 INFO  ipc.RPC - Return: true
2009-02-19 18:16:03,816 INFO  ipc.RPC - Call: ping()
2009-02-19 18:16:03,816 INFO  ipc.RPC - Return: true
2009-02-19 18:16:13,820 INFO  ipc.RPC - Call: ping()
2009-02-19 18:16:13,820 INFO  ipc.RPC - Return: true
2009-02-19 18:16:23,816 INFO  ipc.RPC - Call: ping()
2009-02-19 18:16:23,816 INFO  ipc.RPC - Return: true

And Tomcat:
INFO: Reloading this Context has started
Feb 19, 2009 6:14:22 PM org.apache.catalina.core.StandardContext addApplicationListener
INFO: The listener "org.apache.nutch.searcher.NutchBean$NutchBeanConstructor" is already configured
for this context. The duplicate definition has been ignored.
2009-02-19 18:14:22,818 INFO  NutchBean - creating new bean
Feb 19, 2009 6:14:26 PM org.apache.catalina.loader.WebappClassLoader loadClass
INFO: Illegal access: this web application instance has been stopped already.  Could not load
org.apache.log4j.spi.NOPLoggerRepository.  The eventual following stack trace is caused by
an error thrown for debugging purposes as well as to attempt to terminate the thread which
caused the illegal access, and has no functional impact.
java.lang.IllegalStateException
	at org.apache.catalina.loader.WebappClassLoader.loadClass(WebappClassLoader.java:1246)
	at org.apache.catalina.loader.WebappClassLoader.loadClass(WebappClassLoader.java:1206)
	at java.lang.ClassLoader.loadClassInternal(ClassLoader.java:320)
	at org.apache.log4j.LogManager.getLoggerRepository(LogManager.java:175)
	at org.apache.log4j.LogManager.getLogger(LogManager.java:199)
	at org.apache.log4j.Logger.getLogger(Logger.java:105)
	at org.apache.commons.logging.impl.Log4JLogger.getLogger(Log4JLogger.java:229)
	at org.apache.commons.logging.impl.Log4JLogger.isDebugEnabled(Log4JLogger.java:239)
	at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:211)
	at $Proxy1.ping(Unknown Source)
	at org.apache.nutch.searcher.DistributedSearchBean$PingWorker.run(DistributedSearchBean.java:94)
	at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:441)
	at java.util.concurrent.FutureTask$Sync.innerRunAndReset(FutureTask.java:317)
	at java.util.concurrent.FutureTask.runAndReset(FutureTask.java:150)
	at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$101(ScheduledThreadPoolExecutor.java:98)
	at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.runPeriodic(ScheduledThreadPoolExecutor.java:181)
	at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:205)
	at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
	at java.lang.Thread.run(Thread.java:619)
Exception in thread "IPC Client (47) connection to pp1.hetzner.itroot.de/78.46.86.99:13001
from an unknown user" java.lang.NoClassDefFoundError: org/apache/log4j/spi/NOPLoggerRepository
	at org.apache.log4j.LogManager.getLoggerRepository(LogManager.java:175)
	at org.apache.log4j.LogManager.getLogger(LogManager.java:199)
	at org.apache.log4j.Logger.getLogger(Logger.java:105)
	at org.apache.commons.logging.impl.Log4JLogger.getLogger(Log4JLogger.java:229)
	at org.apache.commons.logging.impl.Log4JLogger.isDebugEnabled(Log4JLogger.java:239)
	at org.apache.hadoop.ipc.Client$Connection.close(Client.java:560)
	at org.apache.hadoop.ipc.Client$Connection.run(Client.java:441)



> Distributed Search Server fails
> -------------------------------
>
>                 Key: NUTCH-694
>                 URL: https://issues.apache.org/jira/browse/NUTCH-694
>             Project: Nutch
>          Issue Type: Bug
>          Components: searcher
>    Affects Versions: 1.0.0
>         Environment: Single Server with one Nutch instance in DistributedSearchServerMode,
not in PseudoDistirubutedMode
>            Reporter: Dr. Nadine Hochstotter
>            Priority: Blocker
>             Fix For: 1.0.0
>
>         Attachments: NUTCH-694.patch
>
>
> I run Nutch on a single server, I have two crawl directories, that's why I use Nutch
 in distributed search server mode as described in the hadoop manual.
> But since I have a new Trunk Version (04.02.2009) it fails. Local search on one index
works fine. But distributed search throws following exception:
> In catalina.out (server)
> 2009-02-18 17:08:14,906 ERROR NutchBean - org.apache.hadoop.ipc.RemoteException: java.io.IOException:
Unknown Protocol classname:org.apache.nutch.searcher.RPCSegmentBean
>        at org.apache.nutch.searcher.NutchBean.getProtocolVersion(NutchBean.java:403)
>        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>        at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
>        at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
>        at java.lang.reflect.Method.invoke(Method.java:597)
>        at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:452)
>        at org.apache.hadoop.ipc.Server$Handler.run(Server.java:892)
>        at org.apache.hadoop.ipc.Client.call(Client.java:696)
>        at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:216)
>        at $Proxy4.getProtocolVersion(Unknown Source)
>        at org.apache.hadoop.ipc.RPC.getProxy(RPC.java:319)
>        at org.apache.hadoop.ipc.RPC.getProxy(RPC.java:306)
>        at org.apache.hadoop.ipc.RPC.getProxy(RPC.java:343)
>        at org.apache.nutch.searcher.DistributedSegmentBean.<init>(DistributedSegmentBean.java:103)
>        at org.apache.nutch.searcher.NutchBean.<init>(NutchBean.java:111)
>        at org.apache.nutch.searcher.NutchBean.<init>(NutchBean.java:80)
>        at org.apache.nutch.searcher.NutchBean$NutchBeanConstructor.contextInitialized(NutchBean.java:422)
>        at org.apache.catalina.core.StandardContext.listenerStart(StandardContext.java:3843)
>        at org.apache.catalina.core.StandardContext.start(StandardContext.java:4350)
>        at org.apache.catalina.core.StandardContext.reload(StandardContext.java:3099)
>        at org.apache.catalina.manager.ManagerServlet.reload(ManagerServlet.java:913)
>        at org.apache.catalina.manager.HTMLManagerServlet.reload(HTMLManagerServlet.java:536)
>        at org.apache.catalina.manager.HTMLManagerServlet.doGet(HTMLManagerServlet.java:114)
>        at javax.servlet.http.HttpServlet.service(HttpServlet.java:690)
>        at javax.servlet.http.HttpServlet.service(HttpServlet.java:803)
>        at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:290)
>        at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:206)
>        at org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:233)
>        at org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:175)
>        at org.apache.catalina.authenticator.AuthenticatorBase.invoke(AuthenticatorBase.java:525)
>        at org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:128)
>        at org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:102)
>        at org.apache.catalina.valves.RequestFilterValve.process(RequestFilterValve.java:269)
>        at org.apache.catalina.valves.RemoteAddrValve.invoke(RemoteAddrValve.java:81)
>        at org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:109)
>        at org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:286)
>        at org.apache.coyote.http11.Http11Processor.process(Http11Processor.java:844)
>        at org.apache.coyote.http11.Http11Protocol$Http11ConnectionHandler.process(Http11Protocol.java:583)
>        at org.apache.tomcat.util.net.JIoEndpoint$Worker.run(JIoEndpoint.java:447)
>        at java.lang.Thread.run(Thread.java:619)
> And in Hadoop.log:
> 2009-02-18 17:07:52,847 INFO  ipc.Server - IPC Server handler 48 on 13001: starting
> 2009-02-18 17:07:52,847 INFO  ipc.Server - IPC Server handler 49 on 13001: starting
> 2009-02-18 17:07:52,847 INFO  ipc.Server - IPC Server handler 40 on 13001: starting
> 2009-02-18 17:08:14,675 INFO  ipc.RPC - Call: getProtocolVersion(org.apache.nutch.searcher.RPCS...
> 2009-02-18 17:08:14,857 INFO  ipc.RPC - Return: 1
> 2009-02-18 17:08:14,878 INFO  ipc.RPC - Call: getProtocolVersion(org.apache.nutch.searcher.RPCS...
> 2009-02-18 17:08:14,879 INFO  ipc.Server - IPC Server handler 0 on 13001, call getProtocolVersion(org.apache.nutch.searcher.RPCSegmentBean,
1) from 78.46.86.99:40851: error: java.io.IOException: Unknown Protocol classname:org.apache.nutch.searcher.RPCSegmentBean
> java.io.IOException: Unknown Protocol classname:org.apache.nutch.searcher.RPCSegmentBean
>        at org.apache.nutch.searcher.NutchBean.getProtocolVersion(NutchBean.java:403)
>        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>        at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
>        at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
>        at java.lang.reflect.Method.invoke(Method.java:597)
>        at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:452)
>        at org.apache.hadoop.ipc.Server$Handler.run(Server.java:892)
> 2009-02-18 17:08:14,879 INFO  ipc.RPC - Call: ping()
> 2009-02-18 17:08:14,938 INFO  ipc.RPC - Return: true
> 2009-02-18 17:08:24,876 INFO  ipc.RPC - Call: ping()
> We do not run Nutch in PseudoDistributedMode. We only use the distributed search mode.
With Nutch-0.9 this was working properly.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message