spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Punit Naik <>
Subject repartitionAndSortWithinPartitions HELP
Date Thu, 14 Jul 2016 17:09:30 GMT
Hi guys

In my spark/scala code I am implementing secondary sort. I wanted to know,
when I call the "repartitionAndSortWithinPartitions" method, the whole
(entire) RDD will be sorted or only the individual partitions will be
If its the latter case, will applying a "sortByKey" after
"repartitionAndSortWithinPartitions" be faster now that the individual
partitions are sorted?

Thank You


Punit Naik

View raw message