hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jean-Daniel Cryans <jdcry...@apache.org>
Subject Re: Which LZO library to use?
Date Mon, 02 Aug 2010 18:01:23 GMT
I think that the person who wrote the header of that page meant that
the hadoop-gpl-compression project lacks fixes included in Kevin's
repo. AFAIK you can hit those if you use LZOed files as input for MR,
but I've been using the second one for more than a year without any
issue (in HBase).


On Mon, Aug 2, 2010 at 3:25 AM, Eran Kutner <eran@gigya.com> wrote:
> Hi,
> I want to enable LZO compression on my cluster but see there are a few
> alternatives and the wiki page itself is very confusing so it's not clear
> what is the right choice. I was looking at this page:
> http://wiki.apache.org/hadoop/UsingLzoCompression, at the top it recommends
> using Kevin Weil's version (which seems to be the same one released by
> Twitter) but warns it doesn't contain all fixes and lower in the article it
> refers to the original google code repository (
> http://code.google.com/p/hadoop-gpl-compression/).
> The thing the concerns me most is future compatibility, so whichever library
> I pick now I want to be certain my data compressed will still be readable
> when I I upgrade to the next major version of Hadoop and Hbase. It seems
> that only the Google code project has newer releases compatible with future
> version of Hadoop.
> So I'm looking for recommendations on which library to use.
> Thanks,
> Eran

View raw message