spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From ag007 <>
Subject Re: Parquet files are only 6-20MB in size?
Date Mon, 03 Nov 2014 09:16:19 GMT
Thanks Akhil,

Am I right in saying that the repartition will spread the data randomly so I
loose chronological order?

I really just want the csv --> parquet format in the same order it came in. 
If I set repartition with 1 will this not be random?


View this message in context:
Sent from the Apache Spark User List mailing list archive at

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message