Hi Romi,
I've observed this many times as well. So much so that on some clusters I
restart the workers every night in order to maintain these worker -> master
connections.
I couldn't find an open SPARK ticket on it so filed
https://issues.apache.org/jira/browse/SPARK-3736 with you and Piotr
ment
Hi all,
Regarding a post here a few months ago
http://apache-spark-user-list.1001560.n3.nabble.com/Workers-disconnected-from-master-sometimes-and-never-reconnect-back-tp6240.html
Is there an answer to this?
I saw workers being still active and not reconnecting after they lost
connection to the ma