dongjoon-hyun commented on PR #54558:
URL: https://github.com/apache/spark/pull/54558#issuecomment-3976676750

   Thank you for review, @peter-toth .
   
   > * What happens after a successful recovery? Will the remaining 
stages/tasks use the 1 core executor?
   
   Here, the successful recovery should be considered as a job completions. If 
OOM kills executors one by one consecutively due to the re-try, the jobs fail 
eventually without moving to the next stage. And, if we consider only a single 
stage, yes. The set of executors will not change further if there is no 
executor loss.
   
   > * What heppens if `spark.task.cpus` is set to >1?
   
   Ya, that's the valid corner case. Let me disable this feature for that 
configuration.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to