Aljoscha Krettek created FLINK-2133: ---------------------------------------
Summary: Possible deadlock in ExecutionGraph Key: FLINK-2133 URL: https://issues.apache.org/jira/browse/FLINK-2133 Project: Flink Issue Type: Bug Reporter: Aljoscha Krettek I had the following output on Travis: {code} Found one Java-level deadlock: ============================= "ForkJoinPool-1-worker-3": waiting to lock monitor 0x00007f1c54af7eb8 (object 0x00000000d77fa8c0, a org.apache.flink.runtime.util.SerializableObject), which is held by "flink-akka.actor.default-dispatcher-4" "flink-akka.actor.default-dispatcher-4": waiting to lock monitor 0x00007f1c5486aca0 (object 0x00000000d77fa218, a org.apache.flink.runtime.util.SerializableObject), which is held by "ForkJoinPool-1-worker-3" Java stack information for the threads listed above: =================================================== "ForkJoinPool-1-worker-3": at org.apache.flink.runtime.executiongraph.ExecutionJobVertex.resetForNewExecution(ExecutionJobVertex.java:338) - waiting to lock <0x00000000d77fa8c0> (a org.apache.flink.runtime.util.SerializableObject) at org.apache.flink.runtime.executiongraph.ExecutionGraph.restart(ExecutionGraph.java:595) - locked <0x00000000d77fa218> (a org.apache.flink.runtime.util.SerializableObject) at org.apache.flink.runtime.executiongraph.ExecutionGraph$3.call(ExecutionGraph.java:733) at akka.dispatch.Futures$$anonfun$future$1.apply(Future.scala:94) at scala.concurrent.impl.Future$PromiseCompletingRunnable.liftedTree1$1(Future.scala:24) at scala.concurrent.impl.Future$PromiseCompletingRunnable.run(Future.scala:24) at scala.concurrent.impl.ExecutionContextImpl$$anon$3.exec(ExecutionContextImpl.scala:107) at scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260) at scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1339) at scala.concurrent.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979) at scala.concurrent.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107) "flink-akka.actor.default-dispatcher-4": at org.apache.flink.runtime.executiongraph.ExecutionGraph.jobVertexInFinalState(ExecutionGraph.java:683) - waiting to lock <0x00000000d77fa218> (a org.apache.flink.runtime.util.SerializableObject) at org.apache.flink.runtime.executiongraph.ExecutionJobVertex.subtaskInFinalState(ExecutionJobVertex.java:454) - locked <0x00000000d77fa8c0> (a org.apache.flink.runtime.util.SerializableObject) at org.apache.flink.runtime.executiongraph.ExecutionJobVertex.vertexCancelled(ExecutionJobVertex.java:426) at org.apache.flink.runtime.executiongraph.ExecutionVertex.executionCanceled(ExecutionVertex.java:565) at org.apache.flink.runtime.executiongraph.Execution.cancelingComplete(Execution.java:653) at org.apache.flink.runtime.executiongraph.ExecutionGraph.updateState(ExecutionGraph.java:784) at org.apache.flink.runtime.jobmanager.JobManager$$anonfun$receiveWithLogMessages$1$$anonfun$applyOrElse$2.apply$mcV$sp(JobManager.scala:220) at org.apache.flink.runtime.jobmanager.JobManager$$anonfun$receiveWithLogMessages$1$$anonfun$applyOrElse$2.apply(JobManager.scala:219) at org.apache.flink.runtime.jobmanager.JobManager$$anonfun$receiveWithLogMessages$1$$anonfun$applyOrElse$2.apply(JobManager.scala:219) at scala.concurrent.impl.Future$PromiseCompletingRunnable.liftedTree1$1(Future.scala:24) at scala.concurrent.impl.Future$PromiseCompletingRunnable.run(Future.scala:24) at akka.dispatch.TaskInvocation.run(AbstractDispatcher.scala:41) at akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinTask.exec(AbstractDispatcher.scala:401) at scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260) at scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.pollAndExecAll(ForkJoinPool.java:1253) at scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1346) at scala.concurrent.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979) at scala.concurrent.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107) Found 1 deadlock. {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)