Good idea, Ng. The UPSERT SELECT command doesn't use MR, it uses HBase APIs. It'd be interesting to see which way is fastest between regular Phoenix APIs vs our MR integration vs our Spark integration. Not 100% sure if these integrations support UPSERT SELECT without some minor modifications. Another option would be to use the CSV Bulk loader.


On Tuesday, August 18, 2015, Ns G <> wrote:

How about loading the data as data frame or RDD and just save the data to new salted table and drop earlier table. I feel spark is very very fast than MR. Just my idea though

On 18-Aug-2015 10:42 pm, "James Taylor" <> wrote:
You can use UPSERT SELECT from the old table to the new table and do this with a single statement:

Make sure you set your timeouts high if the table is big.


On Tue, Aug 18, 2015 at 9:40 AM, Sumanta Gh <> wrote:
Thanks for replying. We are ok with rewriting the whole data.


-----anil gupta <> wrote: -----
To: "" <>
From: anil gupta <>
Date: 08/18/2015 08:08PM
Subject: Re: Table salting

Adding salting to an existing table would mean changing Rowkey of each Row in HBase. I dont think it possible without re-writing entire table with new RowKey(salted). Are you ok with rewriting entire table with new Rowkey?

On Tue, Aug 18, 2015 at 7:29 AM, Sumanta Gh <> wrote:
Is there a way to add salt buckets to an existing table with data?


Notice: The information contained in this e-mail
message and/or attachments to it may contain
confidential or privileged information. If you are
not the intended recipient, any dissemination, use,
review, distribution, printing or copying of the
information contained in this e-mail message
and/or attachments to it are strictly prohibited. If
you have received this communication in error,
please notify us by reply e-mail or telephone and
immediately and permanently delete the message
and any attachments. Thank you

Thanks & Regards,
Anil Gupta