lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Markus Jelsma <markus.jel...@openindex.io>
Subject RE: Apache Nutch 1.5.1 + Apache Solr 4.0
Date Thu, 08 Nov 2012 15:00:47 GMT
Hm, i copied the schema from Nutch' trunk verbatim and only had to change the stemmer.  It
seems like you have, for some reason, a float with an extra point dangling around somewhere.
Can you check?
 
-----Original message-----
> From:Antony Steiner <ant.steiner@gmail.com>
> Sent: Thu 08-Nov-2012 15:54
> To: Markus Jelsma <markus.jelsma@openindex.io>; solr-user@lucene.apache.org
> Subject: Re: Apache Nutch 1.5.1 + Apache Solr 4.0
> 
> Hi,
> 
> I just saw there is a schema-solr4.xml and a schema.xml in the nutch conf
> directory. But with both schemas I get the same errors when starting up
> solr.
> Heres the stacktrace:
> 
> Nov 8, 2012 3:32:14 PM org.apache.solr.core.SolrConfig <init>
> INFO: Loaded SolrConfig: solrconfig.xml
> Nov 8, 2012 3:32:14 PM org.apache.solr.schema.IndexSchema readSchema
> INFO: Reading Solr Schema
> Nov 8, 2012 3:32:14 PM org.apache.solr.schema.IndexSchema readSchema
> INFO: Schema name=nutch
> Nov 8, 2012 3:32:14 PM org.apache.solr.core.CoreContainer create
> SEVERE: Unable to create core: collection1
> org.apache.solr.common.SolrException: Schema Parsing Failed: multiple points
>         at
> org.apache.solr.schema.IndexSchema.readSchema(IndexSchema.java:571)
>         at org.apache.solr.schema.IndexSchema.<init>(IndexSchema.java:113)
>         at org.apache.solr.core.CoreContainer.create(CoreContainer.java:846)
>         at org.apache.solr.core.CoreContainer.load(CoreContainer.java:534)
>         at org.apache.solr.core.CoreContainer.load(CoreContainer.java:356)
>         at
> org.apache.solr.core.CoreContainer$Initializer.initialize(CoreContainer.java:308)
>         at
> org.apache.solr.servlet.SolrDispatchFilter.init(SolrDispatchFilter.java:107)
>         at
> org.eclipse.jetty.servlet.FilterHolder.doStart(FilterHolder.java:114)
>         at
> org.eclipse.jetty.util.component.AbstractLifeCycle.start(AbstractLifeCycle.java:59)
>         at
> org.eclipse.jetty.servlet.ServletHandler.initialize(ServletHandler.java:754)
>         at
> org.eclipse.jetty.servlet.ServletContextHandler.startContext(ServletContextHandler.java:258)
>         at
> org.eclipse.jetty.webapp.WebAppContext.startContext(WebAppContext.java:1221)
>         at
> org.eclipse.jetty.server.handler.ContextHandler.doStart(ContextHandler.java:699)
>         at
> org.eclipse.jetty.webapp.WebAppContext.doStart(WebAppContext.java:454)
>         at
> org.eclipse.jetty.util.component.AbstractLifeCycle.start(AbstractLifeCycle.java:59)
>         at
> org.eclipse.jetty.deploy.bindings.StandardStarter.processBinding(StandardStarter.java:36)
>         at
> org.eclipse.jetty.deploy.AppLifeCycle.runBindings(AppLifeCycle.java:183)
>         at
> org.eclipse.jetty.deploy.DeploymentManager.requestAppGoal(DeploymentManager.java:491)
>         at
> org.eclipse.jetty.deploy.DeploymentManager.addApp(DeploymentManager.java:138)
>         at
> org.eclipse.jetty.deploy.providers.ScanningAppProvider.fileAdded(ScanningAppProvider.java:142)
>         at
> org.eclipse.jetty.deploy.providers.ScanningAppProvider$1.fileAdded(ScanningAppProvider.java:53)
>         at org.eclipse.jetty.util.Scanner.reportAddition(Scanner.java:604)
>         at
> org.eclipse.jetty.util.Scanner.reportDifferences(Scanner.java:535)
>         at org.eclipse.jetty.util.Scanner.scan(Scanner.java:398)
>         at org.eclipse.jetty.util.Scanner.doStart(Scanner.java:332)
>         at
> org.eclipse.jetty.util.component.AbstractLifeCycle.start(AbstractLifeCycle.java:59)
>         at
> org.eclipse.jetty.deploy.providers.ScanningAppProvider.doStart(ScanningAppProvider.java:118)
>         at
> org.eclipse.jetty.util.component.AbstractLifeCycle.start(AbstractLifeCycle.java:59)
>         at
> org.eclipse.jetty.deploy.DeploymentManager.startAppProvider(DeploymentManager.java:552)
>         at
> org.eclipse.jetty.deploy.DeploymentManager.doStart(DeploymentManager.java:227)
>         at
> org.eclipse.jetty.util.component.AbstractLifeCycle.start(AbstractLifeCycle.java:59)
>         at
> org.eclipse.jetty.util.component.AggregateLifeCycle.doStart(AggregateLifeCycle.java:63)
>         at
> org.eclipse.jetty.server.handler.AbstractHandler.doStart(AbstractHandler.java:53)
>         at
> org.eclipse.jetty.server.handler.HandlerWrapper.doStart(HandlerWrapper.java:91)
>         at org.eclipse.jetty.server.Server.doStart(Server.java:263)
>         at
> org.eclipse.jetty.util.component.AbstractLifeCycle.start(AbstractLifeCycle.java:59)
>         at
> org.eclipse.jetty.xml.XmlConfiguration$1.run(XmlConfiguration.java:1215)
>         at java.security.AccessController.doPrivileged(Native Method)
>         at
> org.eclipse.jetty.xml.XmlConfiguration.main(XmlConfiguration.java:1138)
>         at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>         at
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
>         at
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
>         at java.lang.reflect.Method.invoke(Method.java:597)
>         at org.eclipse.jetty.start.Main.invokeMain(Main.java:457)
>         at org.eclipse.jetty.start.Main.start(Main.java:602)
>         at org.eclipse.jetty.start.Main.main(Main.java:82)
> Caused by: java.lang.NumberFormatException: multiple points
>         at
> sun.misc.FloatingDecimal.readJavaFormatString(FloatingDecimal.java:1082)
>         at java.lang.Float.parseFloat(Float.java:422)
>         at org.apache.solr.core.Config.getFloat(Config.java:284)
>         at
> org.apache.solr.schema.IndexSchema.readSchema(IndexSchema.java:358)
>         ... 45 more
> Nov 8, 2012 3:32:14 PM org.apache.solr.common.SolrException log
> SEVERE: null:org.apache.solr.common.SolrException: Schema Parsing Failed:
> multiple points
>         at
> org.apache.solr.schema.IndexSchema.readSchema(IndexSchema.java:571)
>         at org.apache.solr.schema.IndexSchema.<init>(IndexSchema.java:113)
>         at org.apache.solr.core.CoreContainer.create(CoreContainer.java:846)
>         at org.apache.solr.core.CoreContainer.load(CoreContainer.java:534)
>         at org.apache.solr.core.CoreContainer.load(CoreContainer.java:356)
>         at
> org.apache.solr.core.CoreContainer$Initializer.initialize(CoreContainer.java:308)
>         at
> org.apache.solr.servlet.SolrDispatchFilter.init(SolrDispatchFilter.java:107)
>         at
> org.eclipse.jetty.servlet.FilterHolder.doStart(FilterHolder.java:114)
>         at
> org.eclipse.jetty.util.component.AbstractLifeCycle.start(AbstractLifeCycle.java:59)
>         at
> org.eclipse.jetty.servlet.ServletHandler.initialize(ServletHandler.java:754)
>         at
> org.eclipse.jetty.servlet.ServletContextHandler.startContext(ServletContextHandler.java:258)
>         at
> org.eclipse.jetty.webapp.WebAppContext.startContext(WebAppContext.java:1221)
>         at
> org.eclipse.jetty.server.handler.ContextHandler.doStart(ContextHandler.java:699)
>         at
> org.eclipse.jetty.webapp.WebAppContext.doStart(WebAppContext.java:454)
>         at
> org.eclipse.jetty.util.component.AbstractLifeCycle.start(AbstractLifeCycle.java:59)
>         at
> org.eclipse.jetty.deploy.bindings.StandardStarter.processBinding(StandardStarter.java:36)
>         at
> org.eclipse.jetty.deploy.AppLifeCycle.runBindings(AppLifeCycle.java:183)
>         at
> org.eclipse.jetty.deploy.DeploymentManager.requestAppGoal(DeploymentManager.java:491)
>         at
> org.eclipse.jetty.deploy.DeploymentManager.addApp(DeploymentManager.java:138)
>         at
> org.eclipse.jetty.deploy.providers.ScanningAppProvider.fileAdded(ScanningAppProvider.java:142)
>         at
> org.eclipse.jetty.deploy.providers.ScanningAppProvider$1.fileAdded(ScanningAppProvider.java:53)
>         at org.eclipse.jetty.util.Scanner.reportAddition(Scanner.java:604)
>         at
> org.eclipse.jetty.util.Scanner.reportDifferences(Scanner.java:535)
>         at org.eclipse.jetty.util.Scanner.scan(Scanner.java:398)
>         at org.eclipse.jetty.util.Scanner.doStart(Scanner.java:332)
>         at
> org.eclipse.jetty.util.component.AbstractLifeCycle.start(AbstractLifeCycle.java:59)
>         at
> org.eclipse.jetty.deploy.providers.ScanningAppProvider.doStart(ScanningAppProvider.java:118)
>         at
> org.eclipse.jetty.util.component.AbstractLifeCycle.start(AbstractLifeCycle.java:59)
>         at
> org.eclipse.jetty.deploy.DeploymentManager.startAppProvider(DeploymentManager.java:552)
>         at
> org.eclipse.jetty.deploy.DeploymentManager.doStart(DeploymentManager.java:227)
>         at
> org.eclipse.jetty.util.component.AbstractLifeCycle.start(AbstractLifeCycle.java:59)
>         at
> org.eclipse.jetty.util.component.AggregateLifeCycle.doStart(AggregateLifeCycle.java:63)
>         at
> org.eclipse.jetty.server.handler.AbstractHandler.doStart(AbstractHandler.java:53)
>         at
> org.eclipse.jetty.server.handler.HandlerWrapper.doStart(HandlerWrapper.java:91)
>         at org.eclipse.jetty.server.Server.doStart(Server.java:263)
>         at
> org.eclipse.jetty.util.component.AbstractLifeCycle.start(AbstractLifeCycle.java:59)
>         at
> org.eclipse.jetty.xml.XmlConfiguration$1.run(XmlConfiguration.java:1215)
>         at java.security.AccessController.doPrivileged(Native Method)
>         at
> org.eclipse.jetty.xml.XmlConfiguration.main(XmlConfiguration.java:1138)
>         at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>         at
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
>         at
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
>         at java.lang.reflect.Method.invoke(Method.java:597)
>         at org.eclipse.jetty.start.Main.invokeMain(Main.java:457)
>         at org.eclipse.jetty.start.Main.start(Main.java:602)
>         at org.eclipse.jetty.start.Main.main(Main.java:82)
> Caused by: java.lang.NumberFormatException: multiple points
>         at
> sun.misc.FloatingDecimal.readJavaFormatString(FloatingDecimal.java:1082)
>         at java.lang.Float.parseFloat(Float.java:422)
>         at org.apache.solr.core.Config.getFloat(Config.java:284)
>         at
> org.apache.solr.schema.IndexSchema.readSchema(IndexSchema.java:358)
>         ... 45 more
> 
> Regards
> Antony
> 
> 
> 2012/11/8 Markus Jelsma <markus.jelsma@openindex.io>
> 
> > Hi - it fixes it here. Please post the full stack trace.
> >
> > -----Original message-----
> > > From:Antony Steiner <ant.steiner@gmail.com>
> > > Sent: Thu 08-Nov-2012 15:16
> > > To: solr-user@lucene.apache.org
> > > Subject: Re: Apache Nutch 1.5.1 + Apache Solr 4.0
> > >
> > > Hi,
> > >
> > > Thank you for your sugestion. Nope, it didn't change anything. Should I
> > > post the full stacktrace?
> > >
> > > Regards
> > > Antony
> > >
> > >
> > > 2012/11/8 Markus Jelsma <markus.jelsma@openindex.io>
> > >
> > > > Hi,
> > > >
> > > > Your Nutch schema likely points to the old EnglishPorterFilter that
> > > > doesn't exist anymore. You can change that occurance to
> > > > PorterStemFilterFactory, that should fix the issue.
> > > >
> > > > -----Original message-----
> > > > > From:Antony Steiner <ant.steiner@gmail.com>
> > > > > Sent: Thu 08-Nov-2012 14:05
> > > > > To: solr-user@lucene.apache.org
> > > > > Subject: Apache Nutch 1.5.1 + Apache Solr 4.0
> > > > >
> > > > > Hello my name is Antony and I'm new to apache nutch and solr.
> > > > >
> > > > > I want to crawl my website and therefore I downloaded nutch to do
> > this.
> > > > > This works fine. But no I would like to integrate nutch with solr.
Im
> > > > > running this on my unix system.
> > > > > Im trying to follow this tutorial:
> > > > > http://wiki.apache.org/nutch/NutchTutorial
> > > > > But it wont for me. Running Solr without nutch is no problem. I can
> > post
> > > > > documents to solr with post.jar. But what I want to do is post my
> > nutch
> > > > > crawl to solr.
> > > > > Now if I copy the schema.xml from nutch to
> > > > > apache-solr-4.0.0/example/solr/collection1/conf directory aned
> > restart
> > > > solr
> > > > > (java -jar start.jar), I get compiling errors but Solr will start.
> > (Is
> > > > this
> > > > > the correct directory to copy my schema?)
> > > > >
> > > > > Nov 8, 2012 9:40:33 AM org.apache.solr.schema.IndexSchema readSchema
> > > > > INFO: Schema name=nutch
> > > > > Nov 8, 2012 9:40:33 AM org.apache.solr.core.CoreContainer create
> > > > > SEVERE: Unable to create core: collection1
> > > > > org.apache.solr.common.SolrException: Schema Parsing Failed: multiple
> > > > points
> > > > >         at
> > > > > org.apache.solr.schema.IndexSchema.readSchema(IndexSchema.java:571)
> > > > >         at
> > > > org.apache.solr.schema.IndexSchema.<init>(IndexSchema.java:113)
> > > > > ...
> > > > >
> > > > > Nov 8, 2012 9:40:33 AM org.apache.solr.common.SolrException log
> > > > > SEVERE: null:org.apache.solr.common.SolrException: Schema Parsing
> > Failed:
> > > > > multiple points
> > > > >         at
> > > > > org.apache.solr.schema.IndexSchema.readSchema(IndexSchema.java:571)
> > > > >         at
> > > > org.apache.solr.schema.IndexSchema.<init>(IndexSchema.java:113)
> > > > >         at
> > > > org.apache.solr.core.CoreContainer.create(CoreContainer.java:846)
> > > > > ...
> > > > >
> > > > > Now if I don't copy the schema and push my nutch crawl to solr I
get
> > > > > following error:
> > > > >
> > > > > SolrIndexer: starting at 2012-11-08 10:49:02
> > > > > Indexing 5 documents
> > > > > java.io.IOException: Job failed!
> > > > > SolrDeleteDuplicates: starting at 2012-11-08 10:49:47
> > > > > SolrDeleteDuplicates: Solr url: http://photon:8983/solr/
> > > > >
> > > > > And this is taken from the logging:
> > > > > org.apache.solr.common.SolrException: ERROR: [doc=
> > > > > http://e-docs/infrastructure/cpuload_monitor.html] unknown field
> > 'host'
> > > > >
> > > > > What should I do or what am I missing?
> > > > >
> > > > > I hope you can help me
> > > > > Best Regards
> > > > > Antony
> > > > >
> > > >
> > >
> >
> 

Mime
View raw message