Hi spark team
Have cluster wide property spark.kubernetis.executor.deleteontermination to
true.
During the long running job, some of the executor got deleted which have
shuffle data. Because of this, in the subsequent stage , we get lot of
spark shuffle fetch fail exceptions.
Please let me know
Hi All,
I have created the umbrella JIRA
https://issues.apache.org/jira/browse/SPARK-37935, and a few sub-tasks. If
you would like to contribute, please leave a comment in a sub-task that you
are working on it.
Yours faithfully,
Max Gekk
On Wed, Jan 12, 2022 at 9:39 PM Maxim Gekk
wrote:
> Hi