lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Varun Thacker (JIRA)" <j...@apache.org>
Subject [jira] [Comment Edited] (SOLR-12866) Reproducing TestLocalFSCloudBackupRestore and TestHdfsCloudBackupRestore failures
Date Tue, 30 Oct 2018 23:24:00 GMT

    [ https://issues.apache.org/jira/browse/SOLR-12866?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16669415#comment-16669415
] 

Varun Thacker edited comment on SOLR-12866 at 10/30/18 11:23 PM:
-----------------------------------------------------------------

Here is a patch which tries to write a mock testing testing replica asssingment under the
following conditions -
 # We have created a collection with createNodeSet=EMPTY , so clusterstate has already been
created but no replicas are present ( testEmptyCollection.json )
 # Create one replica for all 3 shards of the collection on a 2 node cluster

I want to test if the assignment engine places all of them on only one node. This is what
I am seeing in RestoreCmd as posted on my earlier comment so I want to see where are we going
wrong.

 

Unfortunately as of now, I'm stuck in getting the mock to work. When i run the test I get
this error
{code:java}
16:06:45.053 [TEST-TestPolicy.testPolicyForEmptyCollection-seed#[33EAB712F76AB404]] ERROR
org.apache.solr.client.solrj.cloud.autoscaling.Policy - Exception! prefs = [{
  "minimize":"cores",
  "precision":1}, {"maximize":"freedisk"}], recent r1 = node2, r2 = node1, matrix = 2


...
Caused by: org.apache.solr.common.SolrException
    at org.apache.solr.client.solrj.cloud.autoscaling.Policy.setApproxValuesAndSortNodes(Policy.java:314)
    at org.apache.solr.client.solrj.cloud.autoscaling.Policy$Session.applyRules(Policy.java:606)
...{code}
 The test doesn't like the fact that I have "replicaInfo" as empty.  So the preferences
sort algorithm runs into a NullPointerException and throws the RuntimeException.


was (Author: varunthacker):
Here is a patch which tries to write a mock testing testing replica asssingment under the
following conditions -
 # We have created a collection with createNodeSet=EMPTY , so clusterstate has already been
created but no replicas are present ( testEmptyCollection.json )
 # Create one replica for all 3 shards of the collection on a 2 node cluster

I want to test if the assignment engine places all of them on only one node. This is what
I am seeing in RestoreCmd as posted on my earlier comment so I want to see where are we going
wrong.

 

Unfortunately as of now, I'm stuck in getting the mock to work. When i run the test I get
this error

 
{code:java}
16:06:45.053 [TEST-TestPolicy.testPolicyForEmptyCollection-seed#[33EAB712F76AB404]] ERROR
org.apache.solr.client.solrj.cloud.autoscaling.Policy - Exception! prefs = [{
  "minimize":"cores",
  "precision":1}, {"maximize":"freedisk"}], recent r1 = node2, r2 = node1, matrix = 2


...
Caused by: org.apache.solr.common.SolrException
    at org.apache.solr.client.solrj.cloud.autoscaling.Policy.setApproxValuesAndSortNodes(Policy.java:314)
    at org.apache.solr.client.solrj.cloud.autoscaling.Policy$Session.applyRules(Policy.java:606)
...{code}
 

> Reproducing TestLocalFSCloudBackupRestore and TestHdfsCloudBackupRestore failures
> ---------------------------------------------------------------------------------
>
>                 Key: SOLR-12866
>                 URL: https://issues.apache.org/jira/browse/SOLR-12866
>             Project: Solr
>          Issue Type: Task
>      Security Level: Public(Default Security Level. Issues are Public) 
>            Reporter: Steve Rowe
>            Assignee: Varun Thacker
>            Priority: Major
>         Attachments: SOLR-12866.patch
>
>
> From [https://builds.apache.org/job/Lucene-Solr-BadApples-Tests-7.x/185/], both tests
failed 10/10 iterations for me on branch_7x with the seed:
> {noformat}
> Checking out Revision 37fdcb02d87ec44293ec4942c75a3cb709c45418 (refs/remotes/origin/branch_7x)
> [...]
>    [junit4]   2> NOTE: reproduce with: ant test  -Dtestcase=TestLocalFSCloudBackupRestore
-Dtests.method=test -Dtests.seed=3CD4284489C09DB4 -Dtests.multiplier=2 -Dtests.slow=true -Dtests.badapples=true
-Dtests.locale=mk-MK -Dtests.timezone=Pacific/Kiritimati -Dtests.asserts=true -Dtests.file.encoding=US-ASCII
>    [junit4] FAILURE 10.8s J2 | TestLocalFSCloudBackupRestore.test <<<
>    [junit4]    > Throwable #1: java.lang.AssertionError: Node 127.0.0.1:43864_solr
has 3 replicas. Expected num replicas : 2. state: 
>    [junit4]    > DocCollection(backuprestore_restored//collections/backuprestore_restored/state.json/9)={
>    [junit4]    >   "pullReplicas":0,
>    [junit4]    >   "replicationFactor":1,
>    [junit4]    >   "shards":{
>    [junit4]    >     "shard2":{
>    [junit4]    >       "range":"0-7fffffff",
>    [junit4]    >       "state":"active",
>    [junit4]    >       "replicas":{"core_node62":{
>    [junit4]    >           "core":"backuprestore_restored_shard2_replica_n61",
>    [junit4]    >           "base_url":"https://127.0.0.1:43864/solr",
>    [junit4]    >           "node_name":"127.0.0.1:43864_solr",
>    [junit4]    >           "state":"active",
>    [junit4]    >           "type":"NRT",
>    [junit4]    >           "force_set_state":"false",
>    [junit4]    >           "leader":"true"}},
>    [junit4]    >       "stateTimestamp":"1539459703266853250"},
>    [junit4]    >     "shard1_1":{
>    [junit4]    >       "range":"c0000000-ffffffff",
>    [junit4]    >       "state":"active",
>    [junit4]    >       "replicas":{"core_node64":{
>    [junit4]    >           "core":"backuprestore_restored_shard1_1_replica_n63",
>    [junit4]    >           "base_url":"https://127.0.0.1:43864/solr",
>    [junit4]    >           "node_name":"127.0.0.1:43864_solr",
>    [junit4]    >           "state":"active",
>    [junit4]    >           "type":"NRT",
>    [junit4]    >           "force_set_state":"false",
>    [junit4]    >           "leader":"true"}},
>    [junit4]    >       "stateTimestamp":"1539459703266887720"},
>    [junit4]    >     "shard1_0":{
>    [junit4]    >       "range":"80000000-bfffffff",
>    [junit4]    >       "state":"active",
>    [junit4]    >       "replicas":{"core_node66":{
>    [junit4]    >           "core":"backuprestore_restored_shard1_0_replica_n65",
>    [junit4]    >           "base_url":"https://127.0.0.1:43864/solr",
>    [junit4]    >           "node_name":"127.0.0.1:43864_solr",
>    [junit4]    >           "state":"active",
>    [junit4]    >           "type":"NRT",
>    [junit4]    >           "force_set_state":"false",
>    [junit4]    >           "leader":"true"}},
>    [junit4]    >       "stateTimestamp":"1539459703266910800"}},
>    [junit4]    >   "router":{
>    [junit4]    >     "name":"compositeId",
>    [junit4]    >     "field":"shard_s"},
>    [junit4]    >   "maxShardsPerNode":"-1",
>    [junit4]    >   "autoAddReplicas":"false",
>    [junit4]    >   "nrtReplicas":1,
>    [junit4]    >   "tlogReplicas":0}
>    [junit4]    > 	at __randomizedtesting.SeedInfo.seed([3CD4284489C09DB4:B480179E273CF04C]:0)
>    [junit4]    > 	at org.apache.solr.cloud.api.collections.AbstractCloudBackupRestoreTestCase.lambda$testBackupAndRestore$1(AbstractCloudBackupRestoreTestCase.java:339)
>    [junit4]    > 	at java.util.HashMap.forEach(HashMap.java:1289)
>    [junit4]    > 	at org.apache.solr.cloud.api.collections.AbstractCloudBackupRestoreTestCase.testBackupAndRestore(AbstractCloudBackupRestoreTestCase.java:338)
>    [junit4]    > 	at org.apache.solr.cloud.api.collections.AbstractCloudBackupRestoreTestCase.test(AbstractCloudBackupRestoreTestCase.java:144)
>    [junit4]    > 	at org.apache.solr.cloud.api.collections.TestLocalFSCloudBackupRestore.test(TestLocalFSCloudBackupRestore.java:64)
>    [junit4]    > 	at java.lang.Thread.run(Thread.java:748)
> {noformat}
> {noformat}
>    [junit4]   2> NOTE: reproduce with: ant test  -Dtestcase=TestHdfsCloudBackupRestore
-Dtests.method=test -Dtests.seed=3CD4284489C09DB4 -Dtests.multiplier=2 -Dtests.slow=true -Dtests.badapples=true
-Dtests.locale=bg -Dtests.timezone=Africa/Khartoum -Dtests.asserts=true -Dtests.file.encoding=US-ASCII
>    [junit4] FAILURE 13.3s J0 | TestHdfsCloudBackupRestore.test <<<
>    [junit4]    > Throwable #1: java.lang.AssertionError: Node 127.0.0.1:38450_solr
has 3 replicas. Expected num replicas : 2. state: 
>    [junit4]    > DocCollection(hdfsbackuprestore_restored//collections/hdfsbackuprestore_restored/state.json/10)={
>    [junit4]    >   "pullReplicas":0,
>    [junit4]    >   "replicationFactor":1,
>    [junit4]    >   "shards":{
>    [junit4]    >     "shard2":{
>    [junit4]    >       "range":"0-7fffffff",
>    [junit4]    >       "state":"active",
>    [junit4]    >       "replicas":{"core_node62":{
>    [junit4]    >           "core":"hdfsbackuprestore_restored_shard2_replica_n61",
>    [junit4]    >           "base_url":"https://127.0.0.1:38450/solr",
>    [junit4]    >           "node_name":"127.0.0.1:38450_solr",
>    [junit4]    >           "state":"active",
>    [junit4]    >           "type":"NRT",
>    [junit4]    >           "force_set_state":"false",
>    [junit4]    >           "leader":"true"}},
>    [junit4]    >       "stateTimestamp":"1539459705812441110"},
>    [junit4]    >     "shard1_1":{
>    [junit4]    >       "range":"c0000000-ffffffff",
>    [junit4]    >       "state":"active",
>    [junit4]    >       "replicas":{"core_node64":{
>    [junit4]    >           "core":"hdfsbackuprestore_restored_shard1_1_replica_n63",
>    [junit4]    >           "base_url":"https://127.0.0.1:38450/solr",
>    [junit4]    >           "node_name":"127.0.0.1:38450_solr",
>    [junit4]    >           "state":"active",
>    [junit4]    >           "type":"NRT",
>    [junit4]    >           "force_set_state":"false",
>    [junit4]    >           "leader":"true"}},
>    [junit4]    >       "stateTimestamp":"1539459705812477955"},
>    [junit4]    >     "shard1_0":{
>    [junit4]    >       "range":"80000000-bfffffff",
>    [junit4]    >       "state":"active",
>    [junit4]    >       "replicas":{"core_node66":{
>    [junit4]    >           "core":"hdfsbackuprestore_restored_shard1_0_replica_n65",
>    [junit4]    >           "base_url":"https://127.0.0.1:38450/solr",
>    [junit4]    >           "node_name":"127.0.0.1:38450_solr",
>    [junit4]    >           "state":"active",
>    [junit4]    >           "type":"NRT",
>    [junit4]    >           "force_set_state":"false",
>    [junit4]    >           "leader":"true"}},
>    [junit4]    >       "stateTimestamp":"1539459705812506250"}},
>    [junit4]    >   "router":{
>    [junit4]    >     "name":"compositeId",
>    [junit4]    >     "field":"shard_s"},
>    [junit4]    >   "maxShardsPerNode":"-1",
>    [junit4]    >   "autoAddReplicas":"false",
>    [junit4]    >   "nrtReplicas":1,
>    [junit4]    >   "tlogReplicas":0}
>    [junit4]    > 	at __randomizedtesting.SeedInfo.seed([3CD4284489C09DB4:B480179E273CF04C]:0)
>    [junit4]    > 	at org.apache.solr.cloud.api.collections.AbstractCloudBackupRestoreTestCase.lambda$testBackupAndRestore$1(AbstractCloudBackupRestoreTestCase.java:339)
>    [junit4]    > 	at java.util.HashMap.forEach(HashMap.java:1289)
>    [junit4]    > 	at org.apache.solr.cloud.api.collections.AbstractCloudBackupRestoreTestCase.testBackupAndRestore(AbstractCloudBackupRestoreTestCase.java:338)
>    [junit4]    > 	at org.apache.solr.cloud.api.collections.AbstractCloudBackupRestoreTestCase.test(AbstractCloudBackupRestoreTestCase.java:144)
>    [junit4]    > 	at org.apache.solr.cloud.api.collections.TestHdfsCloudBackupRestore.test(TestHdfsCloudBackupRestore.java:213)
>    [junit4]    > 	at java.lang.Thread.run(Thread.java:748)
> {noformat}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message