spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Rohit Verma <rohit.ve...@rokittech.com>
Subject Re: Spark join over sorted columns of dataset.
Date Fri, 03 Mar 2017 16:06:05 GMT
Sending it to dev’s.
Can you please help me providing some ideas for below.

Regards
Rohit
> On Feb 23, 2017, at 3:47 PM, Rohit Verma <rohit.verma@rokittech.com> wrote:
> 
> Hi
> 
> While joining two columns of different dataset, how to optimize join if both the columns
are pre sorted within the dataset.
> So that when spark do sort merge join the sorting phase can skipped.
> 
> Regards
> Rohit

Mime
View raw message