spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Madhu <>
Subject Re: count()-ing gz files gives incorrect header check
Date Wed, 21 May 2014 13:26:27 GMT
Can you identify a specific file that fails?
There might be a real bug here, but I have found gzip to be reliable.
Every time I have run into a "bad header" error with gzip, I had a non-gzip
file with the wrong extension for whatever reason.

View this message in context:
Sent from the Apache Spark User List mailing list archive at

View raw message