Just out curiosity, why can you just generate the data randomly?

I have used that mechanism and it helps a lot, in case you are just starting to use SPARK.

Gourav Sengupta

On Thu, Sep 28, 2017 at 5:04 PM, Gaurav1809 <gauravhpandya@gmail.com> wrote:
Hi All,

I have setup multi node spark cluster and now looking for good volume of
data to test and see how it works while processing the same.
Can anyone provide pointers as to where can i get few GBs of free sample

Thanks and regards,

Sent from: http://apache-spark-user-list.1001560.n3.nabble.com/

To unsubscribe e-mail: user-unsubscribe@spark.apache.org