kudu-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Sand Stone <sand.m.st...@gmail.com>
Subject Re: Partition and Split rows
Date Fri, 06 May 2016 23:51:04 GMT
Thanks. Will read.

Given that I am researching time series data, row locality is crucial :-)

On Fri, May 6, 2016 at 3:57 PM, Jean-Daniel Cryans <jdcryans@apache.org>
wrote:

> We do have non-covering range partitions coming in the next few months,
> here's the design (in review):
> http://gerrit.cloudera.org:8080/#/c/2772/9/docs/design-docs/non-covering-range-partitions.md
>
> The "Background & Motivation" section should give you a good idea of why
> I'm mentioning this.
>
> Meanwhile, if you don't need row locality, using hash partitioning could
> be good enough.
>
> J-D
>
> On Fri, May 6, 2016 at 3:53 PM, Sand Stone <sand.m.stone@gmail.com> wrote:
>
>> Makes sense.
>>
>> Yeah it would be cool if users could specify/control the split rows after
>> the table is created. Now, I have to "think ahead" to pre-create the range
>> buckets.
>>
>> On Fri, May 6, 2016 at 3:49 PM, Jean-Daniel Cryans <jdcryans@apache.org>
>> wrote:
>>
>>> You will only get 1 tablet and no data distribution, which is bad.
>>>
>>> That's also how HBase works, but it will split regions as you insert
>>> data and eventually you'll get some data distribution even if it doesn't
>>> start in an ideal situation. Tablet splitting will come later for Kudu.
>>>
>>> J-D
>>>
>>> On Fri, May 6, 2016 at 3:42 PM, Sand Stone <sand.m.stone@gmail.com>
>>> wrote:
>>>
>>>> One more questions, how does the range partition work if I don't
>>>> specify the split rows?
>>>>
>>>> Thanks!
>>>>
>>>> On Fri, May 6, 2016 at 3:37 PM, Sand Stone <sand.m.stone@gmail.com>
>>>> wrote:
>>>>
>>>>> Thanks, Misty. The "advanced" impala example helped.
>>>>>
>>>>> I was just reading the Java API,CreateTableOptions.java, it's unclear
>>>>> how the range partition column names associated with the partial rows
>>>>> params in the addSplitRow API.
>>>>>
>>>>> On Fri, May 6, 2016 at 3:08 PM, Misty Stanley-Jones <
>>>>> mstanleyjones@cloudera.com> wrote:
>>>>>
>>>>>> Hi Sand,
>>>>>>
>>>>>> Please have a look at
>>>>>> http://getkudu.io/docs/kudu_impala_integration.html#partitioning_tables
>>>>>> and see if it is helpful to you.
>>>>>>
>>>>>> Thanks,
>>>>>> Misty
>>>>>>
>>>>>> On Fri, May 6, 2016 at 2:00 PM, Sand Stone <sand.m.stone@gmail.com>
>>>>>> wrote:
>>>>>>
>>>>>>> Hi, I am new to Kudu. I wonder how the split rows work. I know
from
>>>>>>> some docs, this is currently for pre-creation the table. I am
researching
>>>>>>> how to partition (hash+range) some time series test data.
>>>>>>>
>>>>>>> Is there an example? or notes somewhere I could read upon.
>>>>>>>
>>>>>>> Thanks much.
>>>>>>>
>>>>>>
>>>>>>
>>>>>
>>>>
>>>
>>
>

Mime
View raw message