lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Mike Drob (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (SOLR-7734) MapReduce Indexer can error when using collection
Date Mon, 29 Jun 2015 20:09:04 GMT

     [ https://issues.apache.org/jira/browse/SOLR-7734?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Mike Drob updated SOLR-7734:
----------------------------
    Attachment: SOLR-7734.patch

Attached a patch that adds a new (embedded) solrconfig.xml file to the map-reduce contrib
module. This config will be loaded in lieu of the one found in ZK for the collection. The
embedded config features a minimal operational footprint, disabling most request handlers,
update chains, and soft commits. It can be overridden by explicitly specifying a --solr-home-dir
argument on the command line when launching the job.

> MapReduce Indexer can error when using collection
> -------------------------------------------------
>
>                 Key: SOLR-7734
>                 URL: https://issues.apache.org/jira/browse/SOLR-7734
>             Project: Solr
>          Issue Type: Bug
>          Components: contrib - MapReduce
>    Affects Versions: 5.2.1
>            Reporter: Mike Drob
>             Fix For: 5.3, Trunk
>
>         Attachments: SOLR-7734.patch
>
>
> When running the MapReduceIndexerTool, it will usually pull a {{solrconfig.xml}} from
ZK for the collection that it is running against. This can be problematic for several reasons:
> * Performance: The configuration in ZK will likely have several query handlers, and lots
of other components that don't make sense in an indexing-only use of EmbeddedSolrServer (ESS).
> * Classpath Resources: If the Solr services are using some kind of additional service
(such as Sentry for auth) then the indexer will not have access to the necessary configurations
without the user jumping through several hoops.
> * Distinct Configuration Needs: Enabling Soft Commits on the ESS doesn't make sense.
There's other configurations that 
> * Update Chain Behaviours: I'm under the impression that UpdateChains may behave differently
in ESS than a SolrCloud cluster. Is it safe to depend on consistent behaviour here?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message