Hi All, I'm running my flink application on YARN. It's frequently getting suspended, though gracefully. Below is the snippet of the error, attaching full jobmanager log to help debug. Please help me identify the cause and resolve the issue.
Thank you Regards, Anchit Error snippet: 2016-11-09 03:15:01,238 INFO org.apache.flink.yarn.YarnClusterClient - Shutting down YarnClusterClient from the client shutdown hook 2016-11-09 03:15:01,242 INFO org.apache.flink.yarn.YarnClusterClient - Sending shutdown request to the Application Master 2016-11-09 03:15:01,248 INFO org.apache.flink.yarn.YarnClusterClient - Start application client. 2016-11-09 03:15:01,260 INFO org.apache.flink.yarn.ApplicationClient - Notification about new leader address akka.tcp://flink@10.60.200.106:36465/user/jobmanager with session ID null. 2016-11-09 03:15:01,263 INFO org.apache.flink.yarn.ApplicationClient - Sending StopCluster request to JobManager. 2016-11-09 03:15:01,264 INFO org.apache.flink.yarn.ApplicationClient - Received address of new leader akka.tcp://flink@10.60.200.106:36465/user/jobmanager with session ID null. 2016-11-09 03:15:01,265 INFO org.apache.flink.yarn.ApplicationClient - Disconnect from JobManager null. 2016-11-09 03:15:01,269 INFO org.apache.flink.yarn.ApplicationClient - Trying to register at JobManager akka.tcp://flink@10.60.200.106:36465/user/jobmanager. 2016-11-09 03:15:01,279 INFO org.apache.flink.yarn.ApplicationClient - Successfully registered at the ResourceManager using JobManager Actor[akka.tcp://flink@10.60.200.106:36465/user/jobmanager#918758350] 2016-11-09 03:15:02,282 INFO org.apache.flink.yarn.ApplicationClient - Sending StopCluster request to JobManager. 2016-11-09 03:15:02,295 INFO org.apache.flink.yarn.YarnClusterClient - Deleting files in hdfs://ldnsns/user/a12345/.flink/application_1478099802210_11790 2016-11-09 03:15:02,319 INFO org.apache.flink.runtime.client.JobClientActor - 11/09/2016 03:15:02 Job execution switched to status SUSPENDED. -- View this message in context: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/Flink-1-1-3-Shutting-down-YarnClusterClient-from-the-client-shutdown-hook-happening-frequently-tp10019.html Sent from the Apache Flink User Mailing List archive. mailing list archive at Nabble.com.