hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jianshi Huang <jianshi.hu...@gmail.com>
Subject Re: Storing JSON in HBase value cell, which serialization format is most compact?
Date Thu, 13 Nov 2014 05:53:35 GMT
I thought FASTDIFF was only for rowkey and columns, great if it also works
in value cell.

And thanks for the bjson link!

Jianshi

On Thu, Nov 13, 2014 at 1:18 PM, Ted Yu <yuzhihong@gmail.com> wrote:

> There is FASTDIFF data block encoding.
>
> See also http://bjson.org/
>
> Cheers
>
> On Nov 12, 2014, at 9:08 PM, Jianshi Huang <jianshi.huang@gmail.com>
> wrote:
>
> > Hi,
> >
> > I'm currently saving JSON in pure String format in the value cell and
> > depends on HBase' block compression to reduce the overhead of JSON.
> >
> > I'm wondering if there's a more space efficient way to store JSON?
> > (there're lots of 0s and 1s, JSON String actually is an OK format)
> >
> > I want to keep the value as a Map since the schema of source data might
> > change over time.
> >
> > Also is there a DIFF based encoding for values? Since I'm storing
> > historical data (snapshot data) and changes between adjacent value cells
> > are relatively small.
> >
> >
> > Thanks,
> > --
> > Jianshi Huang
> >
> > LinkedIn: jianshi
> > Twitter: @jshuang
> > Github & Blog: http://huangjs.github.com/
>



-- 
Jianshi Huang

LinkedIn: jianshi
Twitter: @jshuang
Github & Blog: http://huangjs.github.com/

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message