lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Amrit Sarkar (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SOLR-11278) CdcrBootstrapTest failing in branch_6_6
Date Fri, 01 Sep 2017 17:33:00 GMT

    [ https://issues.apache.org/jira/browse/SOLR-11278?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16150879#comment-16150879
] 

Amrit Sarkar commented on SOLR-11278:
-------------------------------------

Another example, why two simultaneous threads are getting created to invoke BOOTSTRAP?

{code}
  [beaster]   2> 38858 INFO  (updateExecutor-39-thread-1-processing-n:127.0.0.1:42155_solr
x:cdcr-target_shard1_replica_n1 s:shard1 c:cdcr-target r:core_node2) [n:127.0.0.1:42155_solr
c:cdcr-target s:shard1 r:core_node2 x:cdcr-target_shard1_replica_n1] o.a.s.h.CdcrRequestHandler
what' the lock this time :: true :: thread :: org.apache.solr.handler.CdcrRequestHandler@64d1ccf3
  [beaster]   2> 38858 INFO  (qtp1415866434-168) [n:127.0.0.1:42155_solr c:cdcr-target
s:shard1 r:core_node2 x:cdcr-target_shard1_replica_n1] o.a.s.c.S.Request [cdcr-target_shard1_replica_n1]
 webapp=/solr path=/cdcr params={qt=/cdcr&action=BOOTSTRAP_STATUS&wt=javabin&version=2}
status=0 QTime=0
  [beaster]   2> 38859 WARN  (cdcr-bootstrap-status-66-thread-1-processing-n:127.0.0.1:36193_solr
x:cdcr-source_shard1_replica_n1 s:shard1 c:cdcr-source r:core_node2) [n:127.0.0.1:36193_solr
c:cdcr-source s:shard1 r:core_node2 x:cdcr-source_shard1_replica_n1] o.a.s.h.CdcrReplicatorManager
Bootstrap process was not found on target collection: cdcr-target shard: shard1, leader: http://127.0.0.1:42155/solr/cdcr-target_shard1_replica_n1/
  [beaster]   2> 38860 INFO  (cdcr-bootstrap-status-66-thread-1-processing-n:127.0.0.1:36193_solr
x:cdcr-source_shard1_replica_n1 s:shard1 c:cdcr-source r:core_node2) [n:127.0.0.1:36193_solr
c:cdcr-source s:shard1 r:core_node2 x:cdcr-source_shard1_replica_n1] o.a.s.h.CdcrReplicatorManager
Attempting to bootstrap target collection: cdcr-target shard: shard1 leader: http://127.0.0.1:42155/solr/cdcr-target_shard1_replica_n1/
  [beaster]   2> 38860 INFO  (qtp1415866434-173) [n:127.0.0.1:42155_solr c:cdcr-target
s:shard1 r:core_node2 x:cdcr-target_shard1_replica_n1] o.a.s.h.CdcrRequestHandler Boostrap
is issued now. Request :: {action=BOOTSTRAP&qt=/cdcr&masterUrl=http://127.0.0.1:36193/solr/cdcr-source_shard1_replica_n1/&wt=javabin&version=2}
: collection : cdcr-target
  [beaster]   2> 38865 INFO  (recoveryExecutor-40-thread-1-processing-n:127.0.0.1:42155_solr
x:cdcr-target_shard1_replica_n1 s:shard1 c:cdcr-target r:core_node2) [n:127.0.0.1:42155_solr
c:cdcr-target s:shard1 r:core_node2 x:cdcr-target_shard1_replica_n1] o.a.s.u.UpdateLog Starting
to buffer updates. FSUpdateLog{state=ACTIVE, tlog=null}
  [beaster]   2> 38865 INFO  (qtp1415866434-173) [n:127.0.0.1:42155_solr c:cdcr-target
s:shard1 r:core_node2 x:cdcr-target_shard1_replica_n1] o.a.s.c.S.Request [cdcr-target_shard1_replica_n1]
 webapp=/solr path=/cdcr params={qt=/cdcr&masterUrl=http://127.0.0.1:36193/solr/cdcr-source_shard1_replica_n1/&action=BOOTSTRAP&wt=javabin&version=2}
status=0 QTime=4
  [beaster]   2> 38865 INFO  (updateExecutor-39-thread-2-processing-n:127.0.0.1:42155_solr
x:cdcr-target_shard1_replica_n1 s:shard1 c:cdcr-target r:core_node2) [n:127.0.0.1:42155_solr
c:cdcr-target s:shard1 r:core_node2 x:cdcr-target_shard1_replica_n1] o.a.s.h.CdcrRequestHandler
what' the lock this time :: false :: thread :: org.apache.solr.handler.CdcrRequestHandler@64d1ccf3
{code}

the code is like this:

{code}
private void handleBootstrapAction(SolrQueryRequest req, SolrQueryResponse rsp) throws IOException,
SolrServerException {
...................
...................
    Runnable runnable = () -> {
      Lock recoveryLock = req.getCore().getSolrCoreState().getRecoveryLock();
      boolean locked = recoveryLock.tryLock();
      log.info("what' the lock this time :: " + locked + " :: thread :: " + this);
      SolrCoreState coreState = core.getSolrCoreState();
{code}

> CdcrBootstrapTest failing in branch_6_6
> ---------------------------------------
>
>                 Key: SOLR-11278
>                 URL: https://issues.apache.org/jira/browse/SOLR-11278
>             Project: Solr
>          Issue Type: Bug
>      Security Level: Public(Default Security Level. Issues are Public) 
>          Components: CDCR
>            Reporter: Amrit Sarkar
>            Assignee: Varun Thacker
>         Attachments: SOLR-11278-cancel-bootstrap-on-stop.patch, SOLR-11278.patch, test_results
>
>
> I ran beast for 10 rounds:
> ant beast -Dtestcase=CdcrBootstrapTest -Dtests.multiplier=2 -Dtests.slow=true -Dtests.locale=vi
-Dtests.timezone=Asia/Yekaterinburg -Dtests.asserts=true -Dtests.file.encoding=US-ASCII -Dbeast.iters=10
> and seeing following failure:
> {code}
>   [beaster] [01:37:16.282] FAILURE  153s | CdcrBootstrapTest.testBootstrapWithSourceCluster
<<<
>   [beaster]    > Throwable #1: java.lang.AssertionError: Document mismatch on target
after sync expected:<2000> but was:<1000>
> {code}



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message