spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Yogesh Vyas <informy...@gmail.com>
Subject read binary file in PySpark
Date Sun, 02 Apr 2017 06:46:11 GMT
Hi,

I am trying to read binary file in PySpark using API binaryRecords(path,
recordLength), but it is giving all values as ['\x00', '\x00', '\x00',
'\x00',....].

But when I am trying to read the same file using binaryFiles(0, it is
giving me correct rdd, but in form of key-value pair. The value is a string.

I wanted to get the byte array out of binary file. How to get it.??

Regards,
Yogesh

Mime
View raw message