spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From dgoldenberg <>
Subject What is Spark's data retention policy?
Date Tue, 16 Jun 2015 23:10:30 GMT
What is Spark's data retention policy?

As in, the jobs that are sent from the master to the worker nodes, how long
do they persist on those nodes?  What about the RDD data, how is that
cleaned up? Are all RDD's cleaned up at GC time unless they've been
.persist()'ed or .cache()'ed?

View this message in context:
Sent from the Apache Spark User List mailing list archive at

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message