nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "James Tan (JIRA)" <j...@apache.org>
Subject [jira] Commented: (NUTCH-442) Integrate Solr/Nutch
Date Tue, 29 Jul 2008 12:44:31 GMT

    [ https://issues.apache.org/jira/browse/NUTCH-442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12617777#action_12617777
] 

James Tan commented on NUTCH-442:
---------------------------------

I am facing the same issue that Vladimir Garvardt got above. Please see below. I basically
check out the latest nutch version((Revision 680683) from http://svn.apache.org/repos/asf/lucene/nutch/trunk/
then apply only patch442_v6.patch. Do I need to apply any of the earlier patches with the
latest nutch version(Revision 680683). Can anybody please advise on this? Thanks in advance!

.....
Indexer: starting
Indexer: crawldb: crawl.test/crawldb
Indexer: linkdb: crawl.test/linkdb
Indexer: solrUrl: http://localhost:8983/solr/
Indexer: adding segment: file:/nutch-solr/nutch-trunk/crawl.test/segments/20080729183600
Exception in thread "main" java.io.IOException: Job failed!
        at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:894)
        at org.apache.nutch.indexer.Indexer.index(Indexer.java:319)
        at org.apache.nutch.crawl.Crawl.main(Crawl.java:148)

> Integrate Solr/Nutch
> --------------------
>
>                 Key: NUTCH-442
>                 URL: https://issues.apache.org/jira/browse/NUTCH-442
>             Project: Nutch
>          Issue Type: New Feature
>         Environment: Ubuntu linux
>            Reporter: rubdabadub
>         Attachments: Crawl.patch, Indexer.patch, NUTCH-442_v4.patch, NUTCH-442_v5.patch,
NUTCH-442_v6.patch.txt, NUTCH_442_v3.patch, RFC_multiple_search_backends.patch, schema.xml
>
>
> Hi:
> After trying out Sami's patch regarding Solr/Nutch. Can be found here (http://blog.foofactory.fi/2007/02/online-indexing-integrating-nutch-with.html)
and I can confirm it worked :-) And that lead me to request the following :
> I would be very very great full if this could be included in nutch 0.9 as I am trying
to eliminate my python based crawler which post documents to solr. As I am in the corporate
enviornment I can't install trunk version in the production enviornment thus I am asking this
to be included in 0.9 release. I hope my wish would be granted.
> I look forward to get some feedback.
> Thank you.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message