spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jestin Ma <jestinwith.a...@gmail.com>
Subject Using Kyro for DataFrames (Dataset<Row>)?
Date Sun, 07 Aug 2016 22:31:34 GMT
When using DataFrames (Dataset<Row>), there's no option for an Encoder.
Does that mean DataFrames (since it builds on top of an RDD) uses Java
serialization? Does using Kyro make sense as an optimization here?

If not, what's the difference between Java/Kyro serialization, Tungsten,
and Encoders?

Thank you!

Mime
View raw message