Hi,
> impact of an executor dying after a localCheckpoint is taken.
My memory is a bit vague on this, but I'd not be surprised if this
localCheckpoint-ed RDD would be "broken" and any actions would simply throw
an exception like missing partitions or similar. There's no way back.
I wish myself t
Jacek,
Thanks for your response, I am still trying to understand the impact of an
executor dying after a localCheckpoint is taken.
Would the entire spark application fail in this case due to the broken
lineage? Or would the jobs associated with that executor need to be
re-computed from scratch?
T
Hi,
> My understanding is that .localCheckpoint() breaks the lineage of the RDD
True.
> and this requires that the entire RDD to be rebuild instead of being able
to recompute lost partitions.
In a sense, it's as if you saved the partitions to executors and re-read
them back as source data (for