spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From _soumya_ <>
Subject Re: Parallelize on spark context
Date Fri, 07 Nov 2014 21:55:48 GMT
 Don't be worried - you're not the only one to be bitten by this. A little
inspection of the Javadoc told me you have this other option: 

JavaRDD<Integer> distData = sc.parallelize(data, 100);

-- Now the RDD is split into 100 partitions.

View this message in context:
Sent from the Apache Spark User List mailing list archive at

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message