lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hoss Man (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (SOLR-8914) inexplicable "no servers hosting shard: shard2" using MiniSolrCloudCluster
Date Tue, 29 Mar 2016 02:13:25 GMT

     [ https://issues.apache.org/jira/browse/SOLR-8914?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Hoss Man updated SOLR-8914:
---------------------------
    Attachment: live_nodes_mentions.log.txt

Attaching another interesting view: live_nodes_mentions.log.txt...

This is every log entry from ZkStateReader mentioning "live nodes" (not just "live nodes size",
so "Updated live nodes from ZooKeeper..." msgs also show up) distilled down to only the timestamp
port# (or TEST if it's from the main testing thread via MiniSolrCloudCluster or waitForRecoveries)
and log message.

Generated via...
{code}
grep -i "live nodes" jenkins.thetaphi.de_Lucene-Solr-6.x-Solaris_32.log.txt | perl -ple 's/.*2>
(\d+).*?(\[[^\]]*? \]) o.a.s.c.c.ZkStateReader(.*)/$1 $2 $3/; s/\[n:127.0.0.1:(\d+)_solr\s*?\]/port
$1/; s/\[\s+\]/     TEST /' > live_nodes_mentions.log.txt
{code}
... use {{grep -v "port 63099"}} if you want to only focus on the nodes that host a piece
of the collection.


> inexplicable "no servers hosting shard: shard2" using MiniSolrCloudCluster
> --------------------------------------------------------------------------
>
>                 Key: SOLR-8914
>                 URL: https://issues.apache.org/jira/browse/SOLR-8914
>             Project: Solr
>          Issue Type: Bug
>            Reporter: Hoss Man
>         Attachments: jenkins.thetaphi.de_Lucene-Solr-6.x-Solaris_32.log.txt, live_nodes_mentions.log.txt
>
>
> Jenkin's encountered a failure in TestTolerantUpdateProcessorCloud over the weekend....
> {noformat}
> http://jenkins.thetaphi.de/job/Lucene-Solr-6.x-Solaris/32/consoleText
> Checking out Revision c46d7686643e7503304cb35dfe546bce9c6684e7 (refs/remotes/origin/branch_6x)
> Using Java: 64bit/jdk1.8.0 -XX:+UseCompressedOops -XX:+UseG1GC
> {noformat}
> The failure happened during the static setup of the test, when a MiniSolrCloudCluster
& several clients are initialized -- before any code related to TolerantUpdateProcessor
is ever used.
> I can't reproduce this, or really make sense of what i'm (not) seeing here in the logs,
so i'm filing this jira with my analysis in the hopes that someone else can help make sense
of it.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message