spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Reynold Xin <r...@apache.org>
Subject Re: Forcing RDD computation with something else than count() ?
Date Wed, 22 Jan 2014 08:09:04 GMT
You can also do

rdd.foreach(a => Unit)

I actually suspect count is even cheaper than this.



On Tue, Jan 21, 2014 at 5:05 AM, Guillaume Pitel <guillaume.pitel@exensa.com
> wrote:

>  Thanks. So you mean that first() trigger the computation of the WHOLE
> RDD ? That does not sound right, I thought it was lazy.
>
> Guillaume
>
> Hi,
> You can call less expensive operations like first or  take to trigger the
> computation.
>
>
>
>
> --
>    [image: eXenSa]
>  *Guillaume PITEL, Président*
> +33(0)6 25 48 86 80
>
> eXenSa S.A.S. <http://www.exensa.com/>
>  41, rue Périer - 92120 Montrouge - FRANCE
> Tel +33(0)1 84 16 36 77 / Fax +33(0)9 72 28 37 05
>

Mime
View raw message