nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Michael Joyce (JIRA)" <j...@apache.org>
Subject [jira] [Created] (NUTCH-1987) Make bin/crawl indexer agnostic
Date Wed, 15 Apr 2015 15:45:58 GMT
Michael Joyce created NUTCH-1987:
------------------------------------

             Summary: Make bin/crawl indexer agnostic
                 Key: NUTCH-1987
                 URL: https://issues.apache.org/jira/browse/NUTCH-1987
             Project: Nutch
          Issue Type: Improvement
    Affects Versions: 1.9
            Reporter: Michael Joyce
             Fix For: 1.10


The crawl script makes it a bit challenging to use an indexer that isn't Solr. For instance,
when I want to use the indexer-elastic plugin I still need to call the crawler script with
a fake Solr URL otherwise it will skip the indexing step all together.

{code}
bin/crawl urls/ crawl/ "http://fakeurl.com:9200" 1
{code}

It would be nice to keep configuration for the Solr indexer in the conf files (to mirror the
elastic search indexer conf and others) and to make the indexing parameter simply toggle whether
indexing does or doesn't occur instead of also trying to configure the indexer at the same
time.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message