[ https://issues.apache.org/jira/browse/FLINK-18138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17129596#comment-17129596 ]
Robert Metzger commented on FLINK-18138: ---------------------------------------- https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=3056&view=logs&j=5c8e7682-d68f-54d1-16a2-a09310218a49&t=45cc9205-bdb7-5b54-63cd-89fdc0983323 > KeyedComplexChainTest.testMigrationAndRestore fails with AskTimeoutException > on CI > ---------------------------------------------------------------------------------- > > Key: FLINK-18138 > URL: https://issues.apache.org/jira/browse/FLINK-18138 > Project: Flink > Issue Type: Bug > Components: Runtime / Coordination, Tests > Affects Versions: 1.12.0 > Reporter: Robert Metzger > Priority: Critical > Labels: test-stability > > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=2723&view=logs&j=5c8e7682-d68f-54d1-16a2-a09310218a49&t=f508e270-48d6-5f1e-3138-42a17e0714f0 > {code} > 2020-06-04T12:46:11.4458612Z [ERROR] Tests run: 7, Failures: 0, Errors: 1, > Skipped: 0, Time elapsed: 18.942 s <<< FAILURE! - in > org.apache.flink.test.state.operator.restore.keyed.KeyedComplexChainTest > 2020-06-04T12:46:11.4461142Z [ERROR] testMigrationAndRestore[Migrate > Savepoint: > 1.7](org.apache.flink.test.state.operator.restore.keyed.KeyedComplexChainTest) > Time elapsed: 10.486 s <<< ERROR! > 2020-06-04T12:46:11.4464014Z java.util.concurrent.ExecutionException: > java.util.concurrent.TimeoutException: Invocation of public default > java.util.concurrent.CompletableFuture > org.apache.flink.runtime.webmonitor.RestfulGateway.triggerSavepoint(org.apache.flink.api.common.JobID,java.lang.String,boolean,org.apache.flink.api.common.time.Time) > timed out. > 2020-06-04T12:46:11.4466561Z at > java.util.concurrent.CompletableFuture.reportGet(CompletableFuture.java:357) > 2020-06-04T12:46:11.4468016Z at > java.util.concurrent.CompletableFuture.get(CompletableFuture.java:1908) > 2020-06-04T12:46:11.4469645Z at > org.apache.flink.test.state.operator.restore.AbstractOperatorRestoreTestBase.migrateJob(AbstractOperatorRestoreTestBase.java:145) > 2020-06-04T12:46:11.4475048Z at > org.apache.flink.test.state.operator.restore.AbstractOperatorRestoreTestBase.testMigrationAndRestore(AbstractOperatorRestoreTestBase.java:107) > 2020-06-04T12:46:11.4476558Z at > sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) > 2020-06-04T12:46:11.4477687Z at > sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) > 2020-06-04T12:46:11.4479033Z at > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) > 2020-06-04T12:46:11.4480310Z at > java.lang.reflect.Method.invoke(Method.java:498) > 2020-06-04T12:46:11.4481369Z at > org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:50) > 2020-06-04T12:46:11.4482508Z at > org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12) > 2020-06-04T12:46:11.4483663Z at > org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:47) > 2020-06-04T12:46:11.4485020Z at > org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17) > 2020-06-04T12:46:11.4486066Z at > org.junit.rules.ExternalResource$1.evaluate(ExternalResource.java:48) > 2020-06-04T12:46:11.4487061Z at > org.junit.rules.ExternalResource$1.evaluate(ExternalResource.java:48) > 2020-06-04T12:46:11.4488097Z at > org.junit.rules.TestWatcher$1.evaluate(TestWatcher.java:55) > 2020-06-04T12:46:11.4488951Z at > org.junit.rules.RunRules.evaluate(RunRules.java:20) > 2020-06-04T12:46:11.4489934Z at > org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:325) > 2020-06-04T12:46:11.4493504Z at > org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:78) > 2020-06-04T12:46:11.4495100Z at > org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:57) > 2020-06-04T12:46:11.4496193Z at > org.junit.runners.ParentRunner$3.run(ParentRunner.java:290) > 2020-06-04T12:46:11.4497136Z at > org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:71) > 2020-06-04T12:46:11.4498138Z at > org.junit.runners.ParentRunner.runChildren(ParentRunner.java:288) > 2020-06-04T12:46:11.4499404Z at > org.junit.runners.ParentRunner.access$000(ParentRunner.java:58) > 2020-06-04T12:46:11.4500352Z at > org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:268) > 2020-06-04T12:46:11.4501250Z at > org.junit.runners.ParentRunner.run(ParentRunner.java:363) > 2020-06-04T12:46:11.4502194Z at > org.junit.runners.Suite.runChild(Suite.java:128) > 2020-06-04T12:46:11.4502980Z at > org.junit.runners.Suite.runChild(Suite.java:27) > 2020-06-04T12:46:11.4503827Z at > org.junit.runners.ParentRunner$3.run(ParentRunner.java:290) > 2020-06-04T12:46:11.4504861Z at > org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:71) > 2020-06-04T12:46:11.4505812Z at > org.junit.runners.ParentRunner.runChildren(ParentRunner.java:288) > 2020-06-04T12:46:11.4506761Z at > org.junit.runners.ParentRunner.access$000(ParentRunner.java:58) > 2020-06-04T12:46:11.4507953Z at > org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:268) > 2020-06-04T12:46:11.4508872Z at > org.junit.runners.ParentRunner.run(ParentRunner.java:363) > 2020-06-04T12:46:11.4509847Z at > org.apache.maven.surefire.junit4.JUnit4Provider.execute(JUnit4Provider.java:365) > 2020-06-04T12:46:11.4510978Z at > org.apache.maven.surefire.junit4.JUnit4Provider.executeWithRerun(JUnit4Provider.java:273) > 2020-06-04T12:46:11.4512223Z at > org.apache.maven.surefire.junit4.JUnit4Provider.executeTestSet(JUnit4Provider.java:238) > 2020-06-04T12:46:11.4513531Z at > org.apache.maven.surefire.junit4.JUnit4Provider.invoke(JUnit4Provider.java:159) > 2020-06-04T12:46:11.4514958Z at > org.apache.maven.surefire.booter.ForkedBooter.invokeProviderInSameClassLoader(ForkedBooter.java:384) > 2020-06-04T12:46:11.4516165Z at > org.apache.maven.surefire.booter.ForkedBooter.runSuitesInProcess(ForkedBooter.java:345) > 2020-06-04T12:46:11.4517277Z at > org.apache.maven.surefire.booter.ForkedBooter.execute(ForkedBooter.java:126) > 2020-06-04T12:46:11.4518404Z at > org.apache.maven.surefire.booter.ForkedBooter.main(ForkedBooter.java:418) > 2020-06-04T12:46:11.4520150Z Caused by: > java.util.concurrent.TimeoutException: Invocation of public default > java.util.concurrent.CompletableFuture > org.apache.flink.runtime.webmonitor.RestfulGateway.triggerSavepoint(org.apache.flink.api.common.JobID,java.lang.String,boolean,org.apache.flink.api.common.time.Time) > timed out. > 2020-06-04T12:46:11.4521975Z at > com.sun.proxy.$Proxy32.triggerSavepoint(Unknown Source) > 2020-06-04T12:46:11.4523089Z at > org.apache.flink.runtime.minicluster.MiniCluster.lambda$triggerSavepoint$8(MiniCluster.java:595) > 2020-06-04T12:46:11.4524212Z at > java.util.concurrent.CompletableFuture.uniApply(CompletableFuture.java:616) > 2020-06-04T12:46:11.4525492Z at > java.util.concurrent.CompletableFuture.uniApplyStage(CompletableFuture.java:628) > 2020-06-04T12:46:11.4526762Z at > java.util.concurrent.CompletableFuture.thenApply(CompletableFuture.java:1996) > 2020-06-04T12:46:11.4528045Z at > org.apache.flink.runtime.minicluster.MiniCluster.runDispatcherCommand(MiniCluster.java:621) > 2020-06-04T12:46:11.4529217Z at > org.apache.flink.runtime.minicluster.MiniCluster.triggerSavepoint(MiniCluster.java:595) > 2020-06-04T12:46:11.4530410Z at > org.apache.flink.client.program.MiniClusterClient.cancelWithSavepoint(MiniClusterClient.java:89) > 2020-06-04T12:46:11.4531806Z at > org.apache.flink.test.state.operator.restore.AbstractOperatorRestoreTestBase.migrateJob(AbstractOperatorRestoreTestBase.java:143) > 2020-06-04T12:46:11.4532899Z ... 38 more > 2020-06-04T12:46:11.4535761Z Caused by: akka.pattern.AskTimeoutException: Ask > timed out on [Actor[akka://flink/user/rpc/dispatcher_22#-2099760067]] after > [10000 ms]. Message of type > [org.apache.flink.runtime.rpc.messages.LocalFencedMessage]. A typical reason > for `AskTimeoutException` is that the recipient actor didn't send a reply. > 2020-06-04T12:46:11.4537658Z at > akka.pattern.PromiseActorRef$$anonfun$2.apply(AskSupport.scala:635) > 2020-06-04T12:46:11.4538677Z at > akka.pattern.PromiseActorRef$$anonfun$2.apply(AskSupport.scala:635) > 2020-06-04T12:46:11.4539766Z at > akka.pattern.PromiseActorRef$$anonfun$1.apply$mcV$sp(AskSupport.scala:648) > 2020-06-04T12:46:11.4540730Z at > akka.actor.Scheduler$$anon$4.run(Scheduler.scala:205) > 2020-06-04T12:46:11.4541947Z at > scala.concurrent.Future$InternalCallbackExecutor$.unbatchedExecute(Future.scala:601) > 2020-06-04T12:46:11.4543129Z at > scala.concurrent.BatchingExecutor$class.execute(BatchingExecutor.scala:109) > 2020-06-04T12:46:11.4544153Z at > scala.concurrent.Future$InternalCallbackExecutor$.execute(Future.scala:599) > 2020-06-04T12:46:11.4545439Z at > akka.actor.LightArrayRevolverScheduler$TaskHolder.executeTask(LightArrayRevolverScheduler.scala:328) > 2020-06-04T12:46:11.4546724Z at > akka.actor.LightArrayRevolverScheduler$$anon$4.executeBucket$1(LightArrayRevolverScheduler.scala:279) > 2020-06-04T12:46:11.4548065Z at > akka.actor.LightArrayRevolverScheduler$$anon$4.nextTick(LightArrayRevolverScheduler.scala:283) > 2020-06-04T12:46:11.4549389Z at > akka.actor.LightArrayRevolverScheduler$$anon$4.run(LightArrayRevolverScheduler.scala:235) > 2020-06-04T12:46:11.4550327Z at java.lang.Thread.run(Thread.java:748) > {code} -- This message was sent by Atlassian Jira (v8.3.4#803005)