I'm trying to understand when I would want to checkpoint an RDD rather than
just persist to disk.

Every reference I can find to checkpoint related to Spark Streaming.  But
the method is defined in the core Spark library, not Streaming.

Does it exist solely for streaming, or are there circumstances unrelated to
streaming in which I might want to checkpoint...and if so, like what?

Thanks,
Diana

Reply via email to