hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Bill Graham <billgra...@gmail.com>
Subject Re: any static column name behavior in hbase? (ie. not storing column name per row)
Date Wed, 11 May 2011 21:51:48 GMT
HBase will always need to store the column name in each cell that uses it.
The only way to reduce the size taken by storing repeated column names
(besides using compression) is to instead store a small pointer to a lookup
table that holds the column name. Check out OpenTSDB, which does something
similar for efficiently storing name/value pairs repeatedly.

On Wed, May 11, 2011 at 2:44 PM, Hiller, Dean x66079 <
dean.hiller@broadridge.com> wrote:

> I like how I can have X columns in a row that varies from another row.  I
> am wondering if there is a way to have hbase have "static" column names(for
> lack of a better term) where the column names don't take up space for each
> row I add to my database.  It just would be nice to have a significantly
> smaller dataset since a lot of our columns are static and some are more
> fixed.
> Thanks,
> Dean
> This message and any attachments are intended only for the use of the
> addressee and
> may contain information that is privileged and confidential. If the reader
> of the
> message is not the intended recipient or an authorized representative of
> the
> intended recipient, you are hereby notified that any dissemination of this
> communication is strictly prohibited. If you have received this
> communication in
> error, please notify us immediately by e-mail and delete the message and
> any
> attachments from your system.

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message