spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Prem Moola <prem.mo...@gandiva.tech>
Subject RE: Where can I get few GBs of sample data?
Date Thu, 28 Sep 2017 17:33:21 GMT
As mentioned earlier , just testing some random data for the sake of testing isn’t useful
and wouldn’t really yield any meaningful information, with that being said  here are some
free resources for getting 
Data 
www.quandl.com
www.data.gov

Thanks

Prem Moola (201.679.9071)

From: Jörn Franke
Sent: Thursday, September 28, 2017 1:26 PM
To: Gaurav1809
Cc: user@spark.apache.org
Subject: Re: Where can I get few GBs of sample data?

I think just any Dataset is not useful. The data should be close to the real data that you
want to process. Similarly, the processing should be the same as you plan.


> On 28. Sep 2017, at 18:04, Gaurav1809 <gauravhpandya@gmail.com> wrote:
> 
> Hi All,
> 
> I have setup multi node spark cluster and now looking for good volume of
> data to test and see how it works while processing the same.
> Can anyone provide pointers as to where can i get few GBs of free sample
> data?
> 
> Thanks and regards,
> Gaurav
> 
> 
> 
> --
> Sent from: http://apache-spark-user-list.1001560.n3.nabble.com/
> 
> ---------------------------------------------------------------------
> To unsubscribe e-mail: user-unsubscribe@spark.apache.org
> 

---------------------------------------------------------------------
To unsubscribe e-mail: user-unsubscribe@spark.apache.org



Mime
View raw message