[ https://issues.apache.org/jira/browse/FLINK-26514?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17680496#comment-17680496 ]
Matthias Pohl commented on FLINK-26514: --------------------------------------- https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=45184&view=logs&j=5cae8624-c7eb-5c51-92d3-4d2dacedd221&t=5acec1b4-945b-59ca-34f8-168928ce5199&l=29145 > YARNSessionFIFOITCase.testDetachedMode failed on azure > ------------------------------------------------------ > > Key: FLINK-26514 > URL: https://issues.apache.org/jira/browse/FLINK-26514 > Project: Flink > Issue Type: Bug > Components: Deployment / YARN > Affects Versions: 1.15.0, 1.16.0 > Reporter: Yun Gao > Priority: Major > Labels: stale-major, test-stability > > {code:java} > /tmp/.yarn-properties-agent07_azpcontainer > 2022-03-07T03:32:11.1761848Z Mar 07 03:32:11 03:32:10,835 [ Time-limited > test] INFO org.apache.flink.yarn.YARNSessionFIFOITCase [] - > Finished testDetachedMode() > 2022-03-07T03:32:35.2261406Z Mar 07 03:32:35 [INFO] Tests run: 3, Failures: > 0, Errors: 0, Skipped: 0, Time elapsed: 70.987 s - in > org.apache.flink.yarn.YARNSessionFIFOSecuredITCase > 2022-03-07T03:32:35.6496081Z Mar 07 03:32:35 [INFO] > 2022-03-07T03:32:35.6497443Z Mar 07 03:32:35 [INFO] Results: > 2022-03-07T03:32:35.6498560Z Mar 07 03:32:35 [INFO] > 2022-03-07T03:32:35.6499136Z Mar 07 03:32:35 [ERROR] Failures: > 2022-03-07T03:32:35.6501226Z Mar 07 03:32:35 [ERROR] > YARNSessionFIFOITCase.checkForProhibitedLogContents:82->YarnTestBase.ensureNoProhibitedStringInLogFiles:591 > Found a file > /__w/1/s/flink-yarn-tests/target/flink-yarn-tests-fifo/flink-yarn-tests-fifo-logDir-nm-0_0/application_1646623837899_0001/container_1646623837899_0001_01_000001/jobmanager.log > with a prohibited string (one of [Exception, Started > SelectChannelConnector@0.0.0.0:8081]). Excerpts: > 2022-03-07T03:32:35.6502705Z Mar 07 03:32:35 [ > 2022-03-07T03:32:35.6503734Z Mar 07 03:32:35 2022-03-07 03:30:59,734 INFO > org.apache.flink.runtime.entrypoint.component.DispatcherResourceManagerComponent > [] - Closing components. > 2022-03-07T03:32:35.6505105Z Mar 07 03:32:35 2022-03-07 03:30:59,735 INFO > org.apache.flink.runtime.dispatcher.runner.DefaultDispatcherRunner [] - > DefaultDispatcherRunner was revoked the leadership with leader id > 00000000-0000-0000-0000-000000000000. Stopping the DispatcherLeaderProcess. > 2022-03-07T03:32:35.6507058Z Mar 07 03:32:35 2022-03-07 03:30:59,735 INFO > org.apache.flink.runtime.dispatcher.runner.SessionDispatcherLeaderProcess [] > - Stopping SessionDispatcherLeaderProcess. > 2022-03-07T03:32:35.6508245Z Mar 07 03:32:35 2022-03-07 03:30:59,735 INFO > org.apache.flink.runtime.dispatcher.StandaloneDispatcher [] - Stopping > dispatcher akka.tcp://flink@87ed88cbeaa9:44509/user/rpc/dispatcher_0. > 2022-03-07T03:32:35.6509555Z Mar 07 03:32:35 2022-03-07 03:30:59,735 INFO > org.apache.flink.runtime.dispatcher.StandaloneDispatcher [] - Stopping > all currently running jobs of dispatcher > akka.tcp://flink@87ed88cbeaa9:44509/user/rpc/dispatcher_0. > 2022-03-07T03:32:35.6511062Z Mar 07 03:32:35 2022-03-07 03:30:59,736 INFO > org.apache.flink.runtime.resourcemanager.ResourceManagerServiceImpl [] - > Stopping resource manager service. > 2022-03-07T03:32:35.6512171Z Mar 07 03:32:35 2022-03-07 03:30:59,736 INFO > org.apache.flink.runtime.resourcemanager.ResourceManagerServiceImpl [] - > Resource manager service is not running. Ignore revoking leadership. > 2022-03-07T03:32:35.6513468Z Mar 07 03:32:35 2022-03-07 03:30:59,737 INFO > org.apache.flink.runtime.dispatcher.StandaloneDispatcher [] - Stopped > dispatcher akka.tcp://flink@87ed88cbeaa9:44509/user/rpc/dispatcher_0. > 2022-03-07T03:32:35.6515017Z Mar 07 03:32:35 2022-03-07 03:30:59,739 ERROR > org.apache.hadoop.yarn.client.api.async.impl.AMRMClientAsyncImpl [] - > Exception on heartbeat > 2022-03-07T03:32:35.6515805Z Mar 07 03:32:35 java.io.InterruptedIOException: > Interrupted waiting to send RPC request to server > 2022-03-07T03:32:35.6516376Z Mar 07 03:32:35 java.io.InterruptedIOException: > Interrupted waiting to send RPC request to server > 2022-03-07T03:32:35.6517169Z Mar 07 03:32:35 at > org.apache.hadoop.ipc.Client.call(Client.java:1446) > ~[hadoop-common-3.1.3.jar:?] > 2022-03-07T03:32:35.6517979Z Mar 07 03:32:35 at > org.apache.hadoop.ipc.Client.call(Client.java:1388) > ~[hadoop-common-3.1.3.jar:?] > 2022-03-07T03:32:35.6518850Z Mar 07 03:32:35 at > org.apache.hadoop.ipc.ProtobufRpcEngine$Invoker.invoke(ProtobufRpcEngine.java:233) > ~[hadoop-common-3.1.3.jar:?] > 2022-03-07T03:32:35.6520060Z Mar 07 03:32:35 at > org.apache.hadoop.ipc.ProtobufRpcEngine$Invoker.invoke(ProtobufRpcEngine.java:118) > ~[hadoop-common-3.1.3.jar:?] > 2022-03-07T03:32:35.6520935Z Mar 07 03:32:35 at > com.sun.proxy.$Proxy32.allocate(Unknown Source) ~[?:?] > 2022-03-07T03:32:35.6522456Z Mar 07 03:32:35 at > org.apache.hadoop.yarn.api.impl.pb.client.ApplicationMasterProtocolPBClientImpl.allocate(ApplicationMasterProtocolPBClientImpl.java:77) > ~[hadoop-yarn-common-3.1.3.jar:?] > 2022-03-07T03:32:35.6523305Z Mar 07 03:32:35 at > sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) ~[?:1.8.0_292] > 2022-03-07T03:32:35.6524005Z Mar 07 03:32:35 at > sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) > ~[?:1.8.0_292] > 2022-03-07T03:32:35.6524865Z Mar 07 03:32:35 at > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) > ~[?:1.8.0_292] > 2022-03-07T03:32:35.6525468Z Mar 07 03:32:35 at > java.lang.reflect.Method.invoke(Method.java:498) ~[?:1.8.0_292] > 2022-03-07T03:32:35.6526616Z Mar 07 03:32:35 at > org.apache.hadoop.io.retry.RetryInvocationHandler.invokeMethod(RetryInvocationHandler.java:422) > ~[hadoop-common-3.1.3.jar:?] > 2022-03-07T03:32:35.6527629Z Mar 07 03:32:35 at > org.apache.hadoop.io.retry.RetryInvocationHandler$Call.invokeMethod(RetryInvocationHandler.java:165) > ~[hadoop-common-3.1.3.jar:?] > 2022-03-07T03:32:35.6528630Z Mar 07 03:32:35 at > org.apache.hadoop.io.retry.RetryInvocationHandler$Call.invoke(RetryInvocationHandler.java:157) > ~[hadoop-common-3.1.3.jar:?] > 2022-03-07T03:32:35.6529629Z Mar 07 03:32:35 at > org.apache.hadoop.io.retry.RetryInvocationHandler$Call.invokeOnce(RetryInvocationHandler.java:95) > ~[hadoop-common-3.1.3.jar:?] > 2022-03-07T03:32:35.6530612Z Mar 07 03:32:35 at > org.apache.hadoop.io.retry.RetryInvocationHandler.invoke(RetryInvocationHandler.java:359) > ~[hadoop-common-3.1.3.jar:?] > 2022-03-07T03:32:35.6531218Z Mar 07 03:32:35 at > com.sun.proxy.$Proxy33.allocate(Unknown Source) ~[?:?] > 2022-03-07T03:32:35.6532041Z Mar 07 03:32:35 at > org.apache.hadoop.yarn.client.api.impl.AMRMClientImpl.allocate(AMRMClientImpl.java:320) > ~[hadoop-yarn-client-3.1.3.jar:?] > 2022-03-07T03:32:35.6533095Z Mar 07 03:32:35 at > org.apache.hadoop.yarn.client.api.async.impl.AMRMClientAsyncImpl$HeartbeatThread.run(AMRMClientAsyncImpl.java:311) > [hadoop-yarn-client-3.1.3.jar:?] > 2022-03-07T03:32:35.6533938Z Mar 07 03:32:35 Caused by: > java.lang.InterruptedException > 2022-03-07T03:32:35.6534717Z Mar 07 03:32:35 at > java.util.concurrent.FutureTask.awaitDone(FutureTask.java:404) ~[?:1.8.0_292] > 2022-03-07T03:32:35.6535289Z Mar 07 03:32:35 at > java.util.concurrent.FutureTask.get(FutureTask.java:191) ~[?:1.8.0_292] > 2022-03-07T03:32:35.6536571Z Mar 07 03:32:35 at > org.apache.hadoop.ipc.Client$Connection.sendRpcRequest(Client.java:1158) > ~[hadoop-common-3.1.3.jar:?] > 2022-03-07T03:32:35.6537639Z Mar 07 03:32:35 at > org.apache.hadoop.ipc.Client.call(Client.java:1441) > ~[hadoop-common-3.1.3.jar:?] > 2022-03-07T03:32:35.6538117Z Mar 07 03:32:35 ... 17 more > 2022-03-07T03:32:35.6538873Z Mar 07 03:32:35 2022-03-07 03:30:59,748 ERROR > org.apache.flink.runtime.resourcemanager.active.ActiveResourceManager [] - > Fatal error occurred in ResourceManager. > 2022-03-07T03:32:35.6539532Z Mar 07 03:32:35 java.io.InterruptedIOException: > Interrupted waiting to send RPC request to server > 2022-03-07T03:32:35.6540304Z Mar 07 03:32:35 at > org.apache.hadoop.ipc.Client.call(Client.java:1446) > ~[hadoop-common-3.1.3.jar:?] > 2022-03-07T03:32:35.6541103Z Mar 07 03:32:35 at > org.apache.hadoop.ipc.Client.call(Client.java:1388) > ~[hadoop-common-3.1.3.jar:?] > 2022-03-07T03:32:35.6541963Z Mar 07 03:32:35 at > org.apache.hadoop.ipc.ProtobufRpcEngine$Invoker.invoke(ProtobufRpcEngine.java:233) > ~[hadoop-common-3.1.3.jar:?] > 2022-03-07T03:32:35.6543062Z Mar 07 03:32:35 at > org.apache.hadoop.ipc.ProtobufRpcEngine$Invoker.invoke(ProtobufRpcEngine.java:118) > ~[hadoop-common-3.1.3.jar:?] > 2022-03-07T03:32:35.6543657Z Mar 07 03:32:35 at > com.sun.proxy.$Proxy32.allocate(Unknown Source) ~[?:?] > 2022-03-07T03:32:35.6544726Z Mar 07 03:32:35 at > org.apache.hadoop.yarn.api.impl.pb.client.ApplicationMasterProtocolPBClientImpl.allocate(ApplicationMasterProtocolPBClientImpl.java:77) > ~[hadoop-yarn-common-3.1.3.jar:?] > 2022-03-07T03:32:35.6545762Z Mar 07 03:32:35 at > sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) ~[?:1.8.0_292] > 2022-03-07T03:32:35.6546357Z Mar 07 03:32:35 at > sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) > ~[?:1.8.0_292] > 2022-03-07T03:32:35.6547017Z Mar 07 03:32:35 at > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) > ~[?:1.8.0_292] > 2022-03-07T03:32:35.6547501Z Mar 07 03:32:35 ] {code} > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=32584&view=logs&j=245e1f2e-ba5b-5570-d689-25ae21e5302f&t=d04c9862-880c-52f5-574b-a7a79fef8e0f&l=34410 -- This message was sent by Atlassian Jira (v8.20.10#820010)