hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Stack <st...@duboce.net>
Subject Re: Which approach would be better
Date Sat, 04 Dec 2010 22:37:15 GMT
What do you mean by similar?

I'd think the speed would be the same doing inserts.  How many rows
and regions when you are done?  What size cluster?

How do you intend to query HBase?  Will you be requesting clumps of
'similars' or just getting an item at a time?


On Fri, Dec 3, 2010 at 4:28 PM, Peter Haidinyak <phaidinyak@local.com> wrote:
> Hi,
>  Which would be a better approach.
> 1.       Having  every entry into HBase use a unique Row Key
> 2.       Having similar entries into HBase use the same Row Key and then use versions
to extract the data.
> I have noticed that option 2 is much slower for putting data into HBase by a factor of
2.5 but would extracting the information be faster?
> Thanks
> -Pete

View raw message