Re: how to clean shuffle write each iteration

2015-03-03 Thread lisendong
in ALS, I guess all the iteration’s rdds are referenced by its next iteration’s rdd, so all the shuffle data will not be deleted until the als job finished… I guess checkpoint could solve my problem, do you know checkpoint? > 在 2015年3月3日,下午4:18,nitin [via Apache Spark User List] > 写道: > > S

Re: how to clean shuffle write each iteration

2015-03-03 Thread nitin
Shuffle write will be cleaned if it is not referenced by any object directly/indirectly. There is a garbage collector written inside spark which periodically checks for weak references to RDDs/shuffle write/broadcast and deletes them. -- View this message in context: http://apache-spark-user-li

Re: how to clean shuffle write each iteration

2015-03-02 Thread Akhil Das
Can't find anything related to this from the Configurations page http://spark.apache.org/docs/1.2.0/configuration.html, You could probably open a JIRA issue regarding this. Thanks Best Regards On Tue, Mar 3, 2015 at 12:03 PM, lisendong wrote: > I 'm using spark als. > > I set the iteration numb