lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Amrit Sarkar (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (SOLR-11467) CdcrBootstrapTest::testBootstrapWithContinousIndexingOnSourceCluster Failure
Date Wed, 11 Oct 2017 15:20:00 GMT

     [ https://issues.apache.org/jira/browse/SOLR-11467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Amrit Sarkar updated SOLR-11467:
--------------------------------
    Attachment: SOLR-11467-debug-code.log

The primary candidate of our culprit is whether the CORRECT collection-checkpoint has been
returned to source cluster of the target. I have put extensive logging at multiple places
*in the uploaded patch* to note versions of the updates we are sending through. I believe
if it fails again, we will be able to get fair amount of idea where the problem lies. 

Bootstrap is clear of any bugs, as of now.

> CdcrBootstrapTest::testBootstrapWithContinousIndexingOnSourceCluster Failure
> ----------------------------------------------------------------------------
>
>                 Key: SOLR-11467
>                 URL: https://issues.apache.org/jira/browse/SOLR-11467
>             Project: Solr
>          Issue Type: Bug
>      Security Level: Public(Default Security Level. Issues are Public) 
>          Components: CDCR
>    Affects Versions: master (8.0)
>            Reporter: Amrit Sarkar
>         Attachments: SOLR-11467-debug-code.log
>
>
> CdcrBootstrapTest is still failing in master and other branches with:
> {code}
> [junit4] FAILURE  130s J1 | CdcrBootstrapTest.testBootstrapWithContinousIndexingOnSourceCluster
<<<
>    [junit4]    > Throwable #1: java.lang.AssertionError: Document mismatch on target
after sync expected:<2000> but was:<1901>
>    [junit4]    > 	at __randomizedtesting.SeedInfo.seed([41753A7BCCA7C778:953071222BF17483]:0)
>    [junit4]    > 	at org.apache.solr.cloud.CdcrBootstrapTest.testBootstrapWithContinousIndexingOnSourceCluster(CdcrBootstrapTest.java:309)
>    [junit4]    > 	at java.lang.Thread.run(Thread.java:748)
> {code}
> ref: https://jenkins.thetaphi.de/job/Lucene-Solr-7.0-Linux/423/
> From one of the failed Solr jenkins log:
>    [junit4]   2> 1143166 INFO  (cdcr-replicator-4421-thread-1-processing-n:127.0.0.1:62832_solr
x:cdcr-source_shard1_replica_n1 s:shard1 c:cdcr-source r:core_node2) [n:127.0.0.1:62832_solr
c:cdcr-source s:shard1 r:core_node2 x:cdcr-source_shard1_replica_n1] o.a.s.h.CdcrReplicator
Forwarded 991 updates to target cdcr-target
>    [junit4]   2> 1144176 INFO  (cdcr-replicator-4421-thread-1-processing-n:127.0.0.1:62832_solr
x:cdcr-source_shard1_replica_n1 s:shard1 c:cdcr-source r:core_node2) [n:127.0.0.1:62832_solr
c:cdcr-source s:shard1 r:core_node2 x:cdcr-source_shard1_replica_n1] o.a.s.h.CdcrReplicator
Forwarded 909 updates to target cdcr-target
>    [junit4]   2> 1145118 INFO  (cdcr-replicator-4421-thread-1-processing-n:127.0.0.1:62832_solr
x:cdcr-source_shard1_replica_n1 s:shard1 c:cdcr-source r:core_node2) [n:127.0.0.1:62832_solr
c:cdcr-source s:shard1 r:core_node2 x:cdcr-source_shard1_replica_n1] o.a.s.h.CdcrReplicator
Forwarded 0 updates to target cdcr-target
> Total 1900 updates were sent, instead of 2000. Ideally the bootstrap process is responsible
for 1000, and normal cdc replication is responsble for 1000. On looking closely, the bootstrap
is completed successfully. We are 100% sure here, bootstrap worked w/o any issue. And still
1900 updates were sent via replicator, instead of 1000. 



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message