Those are different RDDs that DecisionTree persists, though. It's not redundant.

On Thu, Apr 23, 2015 at 11:12 AM, jimfcarroll <jimfcarr...@gmail.com> wrote:
> Hi Sean and Joe,
>
> I have another question.
>
> GradientBoostedTrees.run iterates over the RDD calling DecisionTree.run on
> each iteration with a new random sample from the input RDD. DecisionTree.run
> calls RandomForest.run. which also calls persist.
>
> One of these seems superfluous.
>
> Should I simply remove the persist call at the GradientBoostedTrees level?
>
> Thanks
> Jim
>
>
>
>
> --
> View this message in context: 
> http://apache-spark-developers-list.1001551.n3.nabble.com/GradientBoostTrees-leaks-a-persisted-RDD-tp11750p11762.html
> Sent from the Apache Spark Developers List mailing list archive at Nabble.com.
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: dev-unsubscr...@spark.apache.org
> For additional commands, e-mail: dev-h...@spark.apache.org
>

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@spark.apache.org
For additional commands, e-mail: dev-h...@spark.apache.org

Reply via email to