spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From janardhan shetty <>
Subject ORC v/s Parquet for Spark 2.0
Date Tue, 26 Jul 2016 02:09:24 GMT
Just wondering advantages and disadvantages to convert data into ORC or

In the documentation of Spark there are numerous examples of Parquet

Any strong reasons to chose Parquet over ORC file format ?

Also : current data compression is bzip2
This seems like biased.

View raw message