lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Erick Erickson (JIRA)" <j...@apache.org>
Subject [jira] [Resolved] (SOLR-11427) DELETEREPLICA with onlyIfDown specified should succeed if the host node is not present in the live_nodes Znode
Date Mon, 25 Dec 2017 22:16:02 GMT

     [ https://issues.apache.org/jira/browse/SOLR-11427?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Erick Erickson resolved SOLR-11427.
-----------------------------------
    Resolution: Won't Fix

Thinking about this after letting it sit for a while, this is a really bad idea.

Nodes delete just fine if you don't specify onlyIfDown=true. The idea of saying "DELETEREPLICA
will respect the onlyIfDown flag even if the node is marked as active if the host happens
be absent" is:

1> confusing. Hey, I said onlyIfDown!

2> dangerous. Small programming or scripting errors and onlyIfDown isn't the safety valve
it was intended to be

3> unnecessary. In the scenario outlined where the node died horribly and even if it's
never coming back, leave off the onlyIfDown bit and the node will go away.

In short, I see no utility here, and opportunities to have unpleasant things happen so not
worth the work.


> DELETEREPLICA with onlyIfDown specified should succeed if the host node is not present
in the live_nodes Znode
> --------------------------------------------------------------------------------------------------------------
>
>                 Key: SOLR-11427
>                 URL: https://issues.apache.org/jira/browse/SOLR-11427
>             Project: Solr
>          Issue Type: Bug
>      Security Level: Public(Default Security Level. Issues are Public) 
>            Reporter: Erick Erickson
>            Assignee: Erick Erickson
>
> The title says it pretty much, so opening up for discussion:
> Here's the problem. Let's say a node is killed via {{kill -9}}. The state.json file still
says it's "active", but the node is gone from live_nodes. If the node in question never comes
back, the replica's state doesn't necessarily get switched to "down", so specifying onlyIfDown
fails with "node is active" message. This is all documented more thoroughly in SOLR-9361.
> The question is whether it's sufficient and/or safe to succeed in deleting the replica
from state.json if the state is "active" _and_ the node is NOT present in live_nodes.
> I'm assigning to myself, but others should feel free to take it.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message