hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Yair Even-Zohar" <ya...@revenuescience.com>
Subject RE: Question on Availability
Date Fri, 07 Nov 2008 17:37:55 GMT
How can we shut down the regieonservers cleanly if the master is down?


-----Original Message-----
From: Jim Kellerman (POWERSET) [mailto:Jim.Kellerman@microsoft.com] 
Sent: Friday, November 07, 2008 8:54 AM
To: hbase-user@hadoop.apache.org
Subject: RE: Question on Availability

Jean-Daniel is correct. Although HDFS has a "warm backup" for
the namenode, it requires physical intervention to start it
up if the primary namenode fails.

With respect to HBase, moving the master is not a big deal.
Shutting down the region servers (cleanly: no kill -9),
pushing a new config which indicates where the new master
will live and restarting the cluster should help get over
a failed master. Eventually we will get rid of the master
as a single point of failure when we integrate Zookeeper,
which will allow us to run a multi-master configuration.

Jim Kellerman, Powerset (Live Search, Microsoft Corporation)

> -----Original Message-----
> From: jdcryans@gmail.com [mailto:jdcryans@gmail.com] On Behalf Of
> Daniel Cryans
> Sent: Friday, November 07, 2008 6:53 AM
> To: hbase-user@hadoop.apache.org
> Subject: Re: Question on Availability
> Michael,
> The Namenode is also a SPOF.
> On keeping a separate cluster for failover, until we add higher
> availability, it depends on how much uptime you need to provide I
guess. I
> personally never saw a machine hosting a Master failing, so I'm not
> on
> how clean it can be regards the META, but I think that just closing
> region servers, changing the config for the master then restart the
> cluster
> with a new master would be nearly enough provided that the Namenode
> hosted on another machine. Maybe Jim or Stack can confirm?
> J-D
> On Fri, Nov 7, 2008 at 5:33 AM, Michael Dagaev
> <michael.dagaev@gmail.com>wrote:
> > Hi, all
> >
> >    I guess that Hbase master server is a single point of failure. Is
> > it correct ? Does Hbase (I mean the whole stack -- HDFS + Hbase)
> > any other single point of failure ?
> >
> >    If Hbase has a single point of failure we should arrange a backup
> > Hbase cluster to switch to it in case of failure. Does it make sense
> >
> > Thank you for your cooperation,
> > M.
> >

View raw message