spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mayur Rustagi <mayur.rust...@gmail.com>
Subject Re: distinct on huge dataset
Date Sat, 22 Mar 2014 14:16:53 GMT
Does it work on a smaller file?

Mayur Rustagi
Ph: +1 (760) 203 3257
http://www.sigmoidanalytics.com
@mayur_rustagi <https://twitter.com/mayur_rustagi>



On Sat, Mar 22, 2014 at 4:50 AM, Ryan Compton <compton.ryan@gmail.com>wrote:

> Does it work without .distinct() ?
>
> Possibly related issue I ran into:
>
> https://mail-archives.apache.org/mod_mbox/spark-user/201401.mbox/%3CCAMgYSQ-3YNwD=VEB1Ct9JRO_jetJ40RJ5Ce_8exGsrhm7jbVQA@mail.gmail.com%3E
>
> On Sat, Mar 22, 2014 at 12:45 AM, Kane <kane.isturm@gmail.com> wrote:
> > It's 0.9.0
> >
> >
> >
> > --
> > View this message in context:
> http://apache-spark-user-list.1001560.n3.nabble.com/distinct-on-huge-dataset-tp3025p3027.html
> > Sent from the Apache Spark User List mailing list archive at Nabble.com.
>

Mime
View raw message