spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From David Thomas <dt5434...@gmail.com>
Subject RDD and Partition
Date Tue, 28 Jan 2014 19:35:40 GMT
Lets say I have an RDD of Strings and there are 26 machines in the cluster.
How can I repartition the RDD in such a way that all strings starting with
A gets collected on machine1, B on machine2 and so on.

Mime
View raw message