Quoting from Spark Program guide:

"For accumulator updates performed inside *actions only*, Spark guarantees
that each task’s update to the accumulator will only be applied once, i.e.
restarted tasks will not update the value. In transformations, users should
be aware of that each task’s update may be applied more than once if tasks
or job stages are re-executed."

Can anyone gives me a possible scenario of when accumulator might be
updated more than once during transformation? Thanks.

Regards,
Wei

Reply via email to