Zhuang Liu created FLINK-31463: ----------------------------------- Summary: When I use apache flink1.12.2 version, the following akka error often occurs. Key: FLINK-31463 URL: https://issues.apache.org/jira/browse/FLINK-31463 Project: Flink Issue Type: Bug Components: Runtime / Network Affects Versions: 1.15.4 Reporter: Zhuang Liu
When I use apache flink1.12.2 version, the following akka error often occurs. java.util.concurrent.TimeoutException: Remote system has been silent for too long. (more than 48.0 hours) at akka.remote.ReliableDeliverySupervisor$$anonfun$idle$1.applyOrElse(Endpoint.scala:375) at akka.actor.Actor$class.aroundReceive(Actor.scala:502) at akka.remote.ReliableDeliverySupervisor.aroundReceive(Endpoint.scala:203) at akka.actor.ActorCell.receiveMessage(ActorCell.scala:526) at akka.actor.ActorCell.invoke(ActorCell.scala:495) at akka.dispatch.Mailbox.processMailbox(Mailbox.scala:257) at akka.dispatch.Mailbox.run(Mailbox.scala:224) at akka.dispatch.Mailbox.exec(Mailbox.scala:234) at scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260) at scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1339) at scala.concurrent.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979) at scala.concurrent.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107) ------------------------------------------------------------------------------------------------ I checked that 48 hours ago, there was indeed a process hang inside flink, and the flink job was restarted.How to deal with this? Is this a bug in akka or flink? Thank you ! -- This message was sent by Atlassian Jira (v8.20.10#820010)