Data correctness issue with Repartition + FetchFailure

Jason Xu Sat, 12 Mar 2022 12:08:45 -0800

Hi Spark community,

I reported a data correctness issue in
https://issues.apache.org/jira/browse/SPARK-38388. In short,
non-deterministic data + Repartition + FetchFailure could result in
incorrect data, this is an issue we run into in production pipelines, I
have an example to reproduce the bug in the ticket.


I report here to bring more attention, could you help confirm it's a bug
and worth effort to further investigate and fix, thank you in advance for
help!

Thanks,
Jason Xu

Data correctness issue with Repartition + FetchFailure

Reply via email to