Quoting from Spark Program guide: "For accumulator updates performed inside *actions only*, Spark guarantees that each task’s update to the accumulator will only be applied once, i.e. restarted tasks will not update the value. In transformations, users should be aware of that each task’s update may be applied more than once if tasks or job stages are re-executed."
Can anyone gives me a possible scenario of when accumulator might be updated more than once during transformation? Thanks. Regards, Wei