Github user markhamstra commented on a diff in the pull request:

    https://github.com/apache/spark/pull/126#discussion_r10590414
  
    --- Diff: core/src/main/scala/org/apache/spark/rdd/RDD.scala ---
    @@ -1025,6 +1025,14 @@ abstract class RDD[T: ClassTag](
         checkpointData.flatMap(_.getCheckpointFile)
       }
     
    +  def cleanup() {
    --- End diff --
    
    This seems like an easy thing to misuse regardless of who is doing the 
calling.  I mean, in addition to making rddA invalid, ...
    ```scala
    val rddA = rdd1.join(rdd2)
    val rddB = rdd3.join(rdd2)
    rddA.cleanup()
    ```
    ...is going to do bad things to rddB, right?  Something that forces 
responsibility for manually tracking dependencies isn't something we should be 
making easily accessible. 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

Reply via email to