From user-return-14244-apmail-hbase-user-archive=hbase.apache.org@hbase.apache.org Sat Dec 04 22:37:44 2010 Return-Path: Delivered-To: apmail-hbase-user-archive@www.apache.org Received: (qmail 91105 invoked from network); 4 Dec 2010 22:37:44 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 4 Dec 2010 22:37:44 -0000 Received: (qmail 11425 invoked by uid 500); 4 Dec 2010 22:37:43 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 11372 invoked by uid 500); 4 Dec 2010 22:37:42 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 11364 invoked by uid 99); 4 Dec 2010 22:37:42 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 04 Dec 2010 22:37:42 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=10.0 tests=FREEMAIL_FROM,RCVD_IN_DNSWL_LOW,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of saint.ack@gmail.com designates 209.85.161.41 as permitted sender) Received: from [209.85.161.41] (HELO mail-fx0-f41.google.com) (209.85.161.41) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 04 Dec 2010 22:37:36 +0000 Received: by fxm13 with SMTP id 13so7922884fxm.14 for ; Sat, 04 Dec 2010 14:37:15 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:sender:received :in-reply-to:references:date:x-google-sender-auth:message-id:subject :from:to:content-type:content-transfer-encoding; bh=Jrzfh9KhaFy0sJaEATkjSJC+UFJBNJFtp3LFqdnOX0s=; b=ABJV3PrxC/6QbbUNJRuMfBnWKcZJNwCmsvKspH8Acl9DbJX3QVvdJZtAjXdwvcXtlu l4CKx3caqgMqSLXWxM7w0OQw3Ht6AEzJseXsCnFStI3d71qwU1nIwKG9XXwmHg7yJAC7 HSn93dt8WLhqP+E0ls1CantuhnAgLNH0GaOhs= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:sender:in-reply-to:references:date :x-google-sender-auth:message-id:subject:from:to:content-type :content-transfer-encoding; b=dQwinv/WF7CJutEX3ON5wt5abMK+ajtWPhjsN5ZwxbPR6ydLn7qWEmnaC7x2Lzg+4c fwdem3jvNMbr6QjjY0qyBws+QDm06bPScNzyAWLKk1P7tAjHiH6fMqKVjIoF0AdZIOSB MKcU06nvR61h/0wSkgW8r9QvTxXJNykfKPVNo= MIME-Version: 1.0 Received: by 10.223.74.193 with SMTP id v1mr3852555faj.105.1291502235824; Sat, 04 Dec 2010 14:37:15 -0800 (PST) Sender: saint.ack@gmail.com Received: by 10.223.83.202 with HTTP; Sat, 4 Dec 2010 14:37:15 -0800 (PST) In-Reply-To: <321C2E54215EEB41A581FDD9DAECBC5DFCB2B3EE@IRV1EXMB01.eLiberation.com> References: <321C2E54215EEB41A581FDD9DAECBC5DFCB2B3EE@IRV1EXMB01.eLiberation.com> Date: Sat, 4 Dec 2010 14:37:15 -0800 X-Google-Sender-Auth: doC3VLlrS-L2ZZGgmBkI_LukP1M Message-ID: Subject: Re: Which approach would be better From: Stack To: user@hbase.apache.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable X-Virus-Checked: Checked by ClamAV on apache.org What do you mean by similar? I'd think the speed would be the same doing inserts. How many rows and regions when you are done? What size cluster? How do you intend to query HBase? Will you be requesting clumps of 'similars' or just getting an item at a time? St.Ack On Fri, Dec 3, 2010 at 4:28 PM, Peter Haidinyak wrot= e: > Hi, > =A0Which would be a better approach. > > 1. =A0 =A0 =A0 Having =A0every entry into HBase use a unique Row Key > > 2. =A0 =A0 =A0 Having similar entries into HBase use the same Row Key and= then use versions to extract the data. > > I have noticed that option 2 is much slower for putting data into HBase b= y a factor of 2.5 but would extracting the information be faster? > > Thanks > > -Pete >