Implementation error: Unhandled exception - "Implementation error: Unhandled exception."

Richard Deurwaarder Wed, 07 Nov 2018 11:36:38 -0800

Hello,

We have a flink job / cluster running in kubernetes. Flink 1.6.2 (but the
same happens in 1.6.0 and 1.6.1) To upgrade our job we use the REST API.


Every so often the jobmanager seems to be stuck in a crashing state and the
logs show me this stack trace:

2018-11-07 18:43:05,815 [flink-scheduler-1] ERROR
org.apache.flink.runtime.rest.handler.cluster.ClusterOverviewHandler -
Implementation error: Unhandled exception.
akka.pattern.AskTimeoutException: Ask timed out on
[Actor[akka://flink/user/dispatcher#1016927511]] after [10000 ms].
Sender[null] sent message of type
"org.apache.flink.runtime.rpc.messages.Implementation error: Unhandled
exception.".
at
akka.pattern.PromiseActorRef$$anonfun$1.apply$mcV$sp(AskSupport.scala:604)
at akka.actor.Scheduler$$anon$4.run(Scheduler.scala:126)
at
scala.concurrent.Future$InternalCallbackExecutor$.unbatchedExecute(Future.scala:601)
at
scala.concurrent.BatchingExecutor$class.execute(BatchingExecutor.scala:109)
at
scala.concurrent.Future$InternalCallbackExecutor$.execute(Future.scala:599)
at
akka.actor.LightArrayRevolverScheduler$TaskHolder.executeTask(LightArrayRevolverScheduler.scala:329)
at
akka.actor.LightArrayRevolverScheduler$$anon$4.executeBucket$1(LightArrayRevolverScheduler.scala:280)
at
akka.actor.LightArrayRevolverScheduler$$anon$4.nextTick(LightArrayRevolverScheduler.scala:284)
at
akka.actor.LightArrayRevolverScheduler$$anon$4.run(LightArrayRevolverScheduler.scala:236)
at java.lang.Thread.run(Thread.java:748)

If I restart the jobmanager everything is fine afterwards, but the
jobmanager will not restart by itself.

What might've caused this and is this something we can prevent?

Richard

Implementation error: Unhandled exception - "Implementation error: Unhandled exception."

Reply via email to