Hi Rainie, I am relatively new to flink, but I suspect that your error is somewhere else in the log. I have found most of my problems by doing a search for the word “error” or “exception”. Since all of these log lines are at the info level, they are probably not highlighting any real issues. If you send more of the log or find an error line that might help others debug.
Thanks, Jesse From: Rainie Li <raini...@pinterest.com> Date: Wednesday, July 15, 2020 at 10:54 AM To: "user@flink.apache.org" <user@flink.apache.org> Subject: flink app crashed Hi All, I am new to Flink, any idea why flink app's Job Manager stuck, here is bottom part from the Job Manager log. Any suggestion will be appreciated. 2020-07-15 16:49:52,749 INFO org.apache.flink.runtime.rpc.akka.AkkaRpcService - Starting RPC endpoint for org.apache.flink.runtime.dispatcher.StandaloneDispatcher at akka://flink/user/dispatcher . 2020-07-15 16:49:52,759 INFO org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService - Starting ZooKeeperLeaderRetrievalService /leader/resource_manager_lock. 2020-07-15 16:49:52,759 INFO org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService - Starting ZooKeeperLeaderRetrievalService /leader/dispatcher_lock. 2020-07-15 16:49:52,762 INFO org.apache.flink.runtime.leaderelection.ZooKeeperLeaderElectionService - Starting ZooKeeperLeaderElectionService ZooKeeperLeaderElectionService{leaderPath='/leader/dispatcher_lock'}. 2020-07-15 16:49:52,790 INFO org.apache.flink.runtime.dispatcher.StandaloneDispatcher - Dispatcher /user/dispatcher was granted leadership with fencing token 2020-07-15 16:49:52,791 INFO org.apache.flink.runtime.dispatcher.StandaloneDispatcher - Recovering all persisted jobs. 2020-07-15 16:49:52,931 INFO org.apache.hadoop.yarn.client.ConfiguredRMFailoverProxyProvider - Failing over to rm1 2020-07-15 16:49:53,014 INFO org.apache.flink.yarn.YarnResourceManager - Recovered 0 containers from previous attempts ([]). 2020-07-15 16:49:53,018 INFO org.apache.hadoop.yarn.client.api.async.impl.NMClientAsyncImpl - Upper bound of the thread pool size is 500 2020-07-15 16:49:53,020 INFO org.apache.hadoop.yarn.client.api.impl.ContainerManagementProtocolProxy - yarn.client.max-cached-nodemanagers-proxies : 0 2020-07-15 16:49:53,021 INFO org.apache.flink.runtime.leaderelection.ZooKeeperLeaderElectionService - Starting ZooKeeperLeaderElectionService ZooKeeperLeaderElectionService{leaderPath='/leader/resource_manager_lock'}. 2020-07-15 16:49:53,042 INFO org.apache.flink.yarn.YarnResourceManager - ResourceManager akka.tcp://flink@cluster-dev-001/user/resourcemanager was granted leadership with fencing token 2020-07-15 16:49:53,046 INFO org.apache.flink.runtime.resourcemanager.slotmanager.SlotManagerImpl - Starting the SlotManager. 2020-07-15 16:50:52,217 INFO org.apache.kafka.clients.Metadata - Cluster ID: FZrfSqHiTpaZwEzIRYkCLQ Thanks Best regards Rainie