spark-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ryan Blue <rb...@netflix.com.INVALID>
Subject Re: Bucketing and catalyst
Date Thu, 02 May 2019 18:46:23 GMT
Andrew,

Here's an umbrella issue that is a good starting point for looking at the
project to add Hive bucketing support:
https://issues.apache.org/jira/browse/SPARK-19256

rb

On Thu, May 2, 2019 at 11:40 AM Long, Andrew <loandrew@amazon.com.invalid>
wrote:

> Hey Friends,
>
>
>
> How aware of bucketing is Catalyst? I’ve been trying to piece together how
> Catalyst knows that it can remove a sort and shuffle given that both tables
> are bucketed and sorted the same way. Is there any classes in particular I
> should look at?
>
>
>
> Cheers Andrew
>


-- 
Ryan Blue
Software Engineer
Netflix

Mime
View raw message