hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Stack <st...@duboce.net>
Subject Re: Query Regarding Design Strategy behind Abortable.
Date Fri, 25 Mar 2011 15:31:23 GMT
On Fri, Mar 25, 2011 at 1:56 AM, Mohit <mohitsikri@huawei.com> wrote:
> Why not reconnect back to the zookeeper(at least try once and then abort, if
> unsuccessful) and resetting trackers/watchers instead of aborting/killing
> HMaster/HRegionServers just like it is done in one of the implementation of
> abort able named HConnectionImplementation present in HConnectionManager?

Hello Mohit:

The ZooKeeper client is doing what you describes, sort of.  On session
timeout, it does a reconnect to the ensemble to ask if its session has
indeed expired.  If it has, then it'll log session expired.

The regionserver will kill itself on loss of session because its
likely that the data it was hosting has been assumed by another.

The retry you refer to, IIRC, is something different -- its before
session setup?  Please cite it if you'd like me to explain.

Do you think the session timed out because of a long GC session?  If
0.90.1, there may be some things you can do.  See


View raw message