spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From lisendong <>
Subject Re: how to clean shuffle write each iteration
Date Tue, 03 Mar 2015 08:24:22 GMT
in  ALS, I guess all the iteration’s rdds are referenced by its next iteration’s rdd,
so all the shuffle data will not be deleted until the als job finished…

I guess checkpoint could solve my problem, do you know checkpoint?

> 在 2015年3月3日,下午4:18,nitin [via Apache Spark User List] <>
> Shuffle write will be cleaned if it is not referenced by any object directly/indirectly.
There is a garbage collector written inside spark which periodically checks for weak references
to RDDs/shuffle write/broadcast and deletes them. 
> If you reply to this email, your message will be added to the discussion below:
> To unsubscribe from how to clean shuffle write each iteration, click here <>.
> NAML <>

View this message in context:
Sent from the Apache Spark User List mailing list archive at
View raw message