dongjoon-hyun commented on PR #54558: URL: https://github.com/apache/spark/pull/54558#issuecomment-3976676750
Thank you for review, @peter-toth . > * What happens after a successful recovery? Will the remaining stages/tasks use the 1 core executor? Here, the successful recovery should be considered as a job completions. If OOM kills executors one by one consecutively due to the re-try, the jobs fail eventually without moving to the next stage. And, if we consider only a single stage, yes. The set of executors will not change further if there is no executor loss. > * What heppens if `spark.task.cpus` is set to >1? Ya, that's the valid corner case. Let me disable this feature for that configuration. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
