Greetings,
This is a scenario that we need to come up with a comprehensive answers to 
fulfil please.
If we have 6 spark VMs each running two executors via spark-submit.
   
   -  we have two VMs failures at H/W level, rack failure
   - we lose 4 executors of spark out of 12
   - Happening half way through the spark-submit job
   -    

So my humble questions are:
   
   - Will there be any data lost from the final result due to missing nodes?
   - How will RDD lineage will handle this?
   - Will there be any delay in getting the final result?
   - How the driver will handle these two nodes failure
   - Will there be additional executors added to the existing nodes or the 
existing executors will handle the job of 4 failing executors.
   - If running in client mode and the node holding the driver dies?
   - If running in cluster mode happens

Did search in Google no satisfactory answers gurus, hence turning to forum.
Best
A.K.

Reply via email to