spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "kumar.rajat20del" <kumar.rajat20...@gmail.com>
Subject repartition in df vs partitionBy in df
Date Sat, 20 Apr 2019 18:48:46 GMT
Hi Spark Users,

repartition and partitionBy seems to be very same in Df. 
In which scenario we use one?

As per my understanding repartition is very expensive operation as it needs
full shuffle then when do we use repartition ?

Thanks
Rajat



--
Sent from: http://apache-spark-user-list.1001560.n3.nabble.com/

---------------------------------------------------------------------
To unsubscribe e-mail: user-unsubscribe@spark.apache.org


Mime
View raw message