Till Rohrmann created FLINK-7279: ------------------------------------ Summary: MiniCluster can deadlock at shut down Key: FLINK-7279 URL: https://issues.apache.org/jira/browse/FLINK-7279 Project: Flink Issue Type: Bug Components: Tests Affects Versions: 1.4.0 Reporter: Till Rohrmann
The {{MiniCluster}} can deadlock in case if the fatal error handler is called while the {{MiniCluster}} shuts down. The reason is that the shut down happens under a lock which is required by the fatal error handler as well. If now the {{MiniCluster}} tries to shut down the underlying RPC service which waits for all actors to terminate, it will never complete because one actor is still waiting for the lock. One solution would be to ignore the fatal error handler calls if the {{MiniCluster}} is shutting down. https://s3.amazonaws.com/archive.travis-ci.org/jobs/257811319/log.txt -- This message was sent by Atlassian JIRA (v6.4.14#64029)