I am still struggling to solve this problem.
I have no doubt that the JOB should automatically restart after restarting
the TASK MANAGER in YARN MODE. Is it a misunderstood?

Problem seems that *JOB MANAGER still try to connect to old TASK MANAGER
even after new TASK MANAGER container be created.*
When I killed TM on node#2 then new TM container is created on node#3, but
JM still tries to connect to TM on node#2 according to the log file. (It was
not a log I posted before, when I found it while continuing the test.
Normally the TM be created on the same node after killed.)
So new TM don't know JOB info and JM show us JOB with fail status.

If anyone has succeeded in the same situation(YARN + TM FAILURE), please
just tell me.
That will be big help to me.



--
Sent from: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/

Reply via email to