spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ramkumar Chokkalingam <ramkumar...@gmail.com>
Subject Support for gz files ?
Date Mon, 21 Oct 2013 06:58:23 GMT
Hello group,

Am having .gz files as part of my input and when reading on the support for
gzip files, I stumbled upon this thread on StackOverflow
<http://stackoverflow.com/questions/16302385/gzip-support-in-spark/16309699#16309699>
which
says that Spark supports gz files. But a few days back I saw a mail thread
 here in the group pointing to this
link<https://www.inkling.com/read/hadoop-definitive-guide-tom-white-3rd/chapter-4/compression#8ca1fda1252b67145680b3a5e9d45b2a>
and
claiming that *Spark does not handle .gz files as they are not splittable*.


These two items seems to be ambiguous. Can anyone confirm on the real
scenario ? Thanks!

Regards,

Ramkumar Chokkalingam ,
University of Washington.
LinkedIn <http://www.linkedin.com/in/mynameisram>

Mime
View raw message