spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From innowireless TaeYun Kim <>
Subject Possibly a dumb question: differences between saveAsNewAPIHadoopFile and saveAsNewAPIHadoopDataset?
Date Mon, 22 Sep 2014 06:24:44 GMT


I'm confused with saveAsNewAPIHadoopFile and saveAsNewAPIHadoopDataset.

What's the difference between the two?

What's the individual use cases of the two APIs?

Could you describe the internal flows of the two APIs briefly?


I've used Spark several months, but I have no experience on MapReduce

(I've read a few book chapters on MapReduce, but actually not written code

So maybe this confusion comes from my lack of experience on MapReduce

(I hoped it won't necessary to have since I could use Spark.)




View raw message