nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Lewis John McGibbney (JIRA)" <>
Subject [jira] [Updated] (NUTCH-1568) port pluggable indexing architecture to 2.x
Date Mon, 13 Jan 2014 18:44:58 GMT


Lewis John McGibbney updated NUTCH-1568:

    Attachment: NUTCH-1568-v4.patch

Patch for 2.x HEAD. [~talat], changes from last patch include

 * reverts changes to org.apache.nutch.indexer.elastic package. We should deal with this exclusively
within its own issue. Simply deleting the classes for the time being is not a good idea IMHO.
 * Upgrades all Solr dependencies to 4.6.0 (most recent) including the schema.xml, which now
acts as the ONLY schema. Consequently we remove schema-solr4.xml
 * Begins to sort out possibly classloading issues regarding the dependencies included in
the new indexer-solr plugin.xml file. If you take a look in to the generated runtime/local/plugin/indexer-solr/
directory, you will see duplicate versions of several libraries... this is not good and a
waste of resources.

I am having one problem even running this patch with Solr 4.4.0, namely that when I invoke

./bin/nutch index -all -D solr.server.url=http://localhost:8983/solr/ I get the default logging
message e.g.

java.lang.Exception: java.lang.RuntimeException: Missing SOLR URL. Should be set via -D solr.server.url
	solr.server.url : URL of the SOLR instance (mandatory)
	solr.commit.size : buffer size when sending to SOLR (default 1000)
	solr.mapping.file : name of the mapping file for fields (default solrindex-mapping.xml)
	solr.auth : use authentication (default false)
	solr.auth.username : use authentication (default false)
	solr.auth : username for authentication
	solr.auth.password : password for authentication 

> port pluggable indexing architecture to 2.x
> -------------------------------------------
>                 Key: NUTCH-1568
>                 URL:
>             Project: Nutch
>          Issue Type: Improvement
>          Components: indexer
>    Affects Versions: 2.2
>            Reporter: Lewis John McGibbney
>             Fix For: 2.3
>         Attachments: NUTCH-1568-v2.patch, NUTCH-1568-v3.path, NUTCH-1568-v4.patch, NUTCH-1568.patch
> I would like to port the work done by Julien on NUTCH-1047 to 2.x. This issue should
track that. It would be nice to do the upgrade in NUTCH-1486 before we do the upgrade so that
people can get using with solr 4.x ASAP.

This message was sent by Atlassian JIRA

View raw message