qpid-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF subversion and git services (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (QPID-6209) Ensure node discoverer returns a value for every node even if node is partitioned
Date Tue, 04 Nov 2014 15:33:34 GMT

    [ https://issues.apache.org/jira/browse/QPID-6209?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14196229#comment-14196229
] 

ASF subversion and git services commented on QPID-6209:
-------------------------------------------------------

Commit 1636610 from [~k-wall] in branch 'qpid/trunk'
[ https://svn.apache.org/r1636610 ]

QPID-6209: [Java Broker] Ensure node discoverer returns a value for every node even if target
node is partitioned

> Ensure node discoverer returns a value for every node even if node is partitioned
> ---------------------------------------------------------------------------------
>
>                 Key: QPID-6209
>                 URL: https://issues.apache.org/jira/browse/QPID-6209
>             Project: Qpid
>          Issue Type: Bug
>          Components: Java Broker
>    Affects Versions: 0.30
>            Reporter: Keith Wall
>              Labels: HA
>             Fix For: 0.31
>
>
> The BDB HA node discoverer is used by the BDBHAVHN to determine the state of the other
nodes in the group.  The information is presented on the UI so that the Operator has a complete
picture of the group.
> There was an issue found during the HA acceptance test that deliberately forms a network
partition around the a master node.  It was seen that the state of the group when observed
from the other nodes was incorrect:  the group appeared to have two masters - both the newly
elected master and the original master. 
> The underlying issue was the monitoring itself.  JE DbPing hangs indefinitely whilst
trying to connect to the target node.  If the packets are lost (as is the case during our
test which uses iptables DROP), then it hangs indefinitely. This behaviour meant the the node
discoverer (ReplicatedEnvironmentFacade.RemoteNodeStateLearner#discoverNodeStates) returned
no map entry for the partitioned node and thus caused the stale value to linger on the UI.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@qpid.apache.org
For additional commands, e-mail: dev-help@qpid.apache.org


Mime
View raw message