Hi Spark community, I reported a data correctness issue in https://issues.apache.org/jira/browse/SPARK-38388. In short, non-deterministic data + Repartition + FetchFailure could result in incorrect data, this is an issue we run into in production pipelines, I have an example to reproduce the bug in the ticket.
I report here to bring more attention, could you help confirm it's a bug and worth effort to further investigate and fix, thank you in advance for help! Thanks, Jason Xu