lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hendrik Haddorp (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SOLR-10092) HDFS: AutoAddReplica fails
Date Thu, 23 Feb 2017 07:46:44 GMT

    [ https://issues.apache.org/jira/browse/SOLR-10092?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15880065#comment-15880065
] 

Hendrik Haddorp commented on SOLR-10092:
----------------------------------------

For a setup using a local filesystem I did not see this code to be triggered at all. But I
was just trying to reproduce this on an unpatched installation and for some reason it looks
like it worked now as well. So am going to recheck again. From what I saw in the code it looked
like the code required the shard id/name to be set, which is also what the exception said,
but the OverseerAutoReplicaFailoverThread is not doing that.

Regarding the instance dir. I'm seeing this in the logs:
2017-02-23 06:43:13.968 INFO  (qtp1224347463-12) [c:test.test s:shard1 r:core_node3 x:test.test_shard1_replica1]
o.a.s.c.SolrCore [[test.test_shard1_replica1] ] Opening new SolrCore at [/var/opt/solr/test.test_shard1_replica1],
dataDir=[hdfs://my-hdfs-namenode:8000/solr/test.test/core_node3/data/]
So even for HDFS there is local information. The folder only contains a core.properties file.
Seems to contain everything required to determine the replica. Not sure why this is not taken
from ZooKeeper though.

> HDFS: AutoAddReplica fails
> --------------------------
>
>                 Key: SOLR-10092
>                 URL: https://issues.apache.org/jira/browse/SOLR-10092
>             Project: Solr
>          Issue Type: Bug
>      Security Level: Public(Default Security Level. Issues are Public) 
>          Components: hdfs
>    Affects Versions: 6.3
>            Reporter: Hendrik Haddorp
>         Attachments: SOLR-10092.patch
>
>
> OverseerAutoReplicaFailoverThread fails to create replacement core with this exception:
> o.a.s.c.OverseerAutoReplicaFailoverThread Exception trying to create new replica on http://...:9000/solr:org.apache.solr.client.solrj.impl.HttpSolrClient$RemoteSolrException:
Error from server at http://...:9000/solr: Error CREATEing SolrCore 'test2.collection-09_shard1_replica1':
Unable to create core [test2.collection-09_shard1_replica1] Caused by: No shard id for CoreDescriptor[name=test2.collection-09_shard1_replica1;instanceDir=/var/opt/solr/test2.collection-09_shard1_replica1]
>     at org.apache.solr.client.solrj.impl.HttpSolrClient.executeMethod(HttpSolrClient.java:593)
>     at org.apache.solr.client.solrj.impl.HttpSolrClient.request(HttpSolrClient.java:262)
>     at org.apache.solr.client.solrj.impl.HttpSolrClient.request(HttpSolrClient.java:251)
>     at org.apache.solr.client.solrj.SolrClient.request(SolrClient.java:1219)
>     at org.apache.solr.cloud.OverseerAutoReplicaFailoverThread.createSolrCore(OverseerAutoReplicaFailoverThread.java:456)
>     at org.apache.solr.cloud.OverseerAutoReplicaFailoverThread.lambda$addReplica$0(OverseerAutoReplicaFailoverThread.java:251)
>     at java.util.concurrent.FutureTask.run(FutureTask.java:266)
>     at org.apache.solr.common.util.ExecutorUtil$MDCAwareThreadPoolExecutor.lambda$execute$0(ExecutorUtil.java:229)
>     at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
>     at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
>     at java.lang.Thread.run(Thread.java:745) 
> also see this mail thread about the issue: https://lists.apache.org/thread.html/%3CCAA70BoWyzbvQuJTyzaG4Kx1tj0Djgcm+MV=x_HoAc1e6CSE7ww@mail.gmail.com%3E



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message