spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Sharon Rapoport <>
Subject saving rdd to multiple files named by the key
Date Tue, 27 Jan 2015 02:14:30 GMT

I have an rdd of [k,v] pairs. I want to save each [v] to a file named [k].
I got them by combining many [k,v] by [k]. I could then save to file by
partitions, but that still doesn't allow me to choose the name, and leaves
me stuck with foo/part-0000...

Any tips?


View raw message