Spark on k8s : spark 3.0.1 spark.kubernetes.executor.deleteontermination issue

2022-01-17 Thread Pralabh Kumar
Hi spark team Have cluster wide property spark.kubernetis.executor.deleteontermination to true. During the long running job, some of the executor got deleted which have shuffle data. Because of this, in the subsequent stage , we get lot of spark shuffle fetch fail exceptions. Please let me know

Re: Migration onto error classes and their testing

2022-01-17 Thread Maxim Gekk
Hi All, I have created the umbrella JIRA https://issues.apache.org/jira/browse/SPARK-37935, and a few sub-tasks. If you would like to contribute, please leave a comment in a sub-task that you are working on it. Yours faithfully, Max Gekk On Wed, Jan 12, 2022 at 9:39 PM Maxim Gekk wrote: > Hi