Matthias Pohl created FLINK-31609: ------------------------------------- Summary: Fatal error in ResourceManager caused YARNSessionFIFOSecuredITCase.testDetachedMode to fail Key: FLINK-31609 URL: https://issues.apache.org/jira/browse/FLINK-31609 Project: Flink Issue Type: Bug Components: Deployment / YARN Affects Versions: 1.18.0 Reporter: Matthias Pohl
This looks like FLINK-30908. I created a follow-up ticket because we reached a new minor version. https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=47547&view=logs&j=fc5181b0-e452-5c8f-68de-1097947f6483&t=995c650b-6573-581c-9ce6-7ad4cc038461 {code} Mar 24 09:32:29 2023-03-24 09:31:50,001 ERROR org.apache.hadoop.yarn.client.api.async.impl.AMRMClientAsyncImpl [] - Exception on heartbeat Mar 24 09:32:29 java.io.InterruptedIOException: Interrupted waiting to send RPC request to server Mar 24 09:32:29 java.io.InterruptedIOException: Interrupted waiting to send RPC request to server Mar 24 09:32:29 at org.apache.hadoop.ipc.Client.call(Client.java:1461) ~[hadoop-common-2.10.2.jar:?] Mar 24 09:32:29 at org.apache.hadoop.ipc.Client.call(Client.java:1403) ~[hadoop-common-2.10.2.jar:?] Mar 24 09:32:29 at org.apache.hadoop.ipc.ProtobufRpcEngine$Invoker.invoke(ProtobufRpcEngine.java:230) ~[hadoop-common-2.10.2.jar:?] Mar 24 09:32:29 at org.apache.hadoop.ipc.ProtobufRpcEngine$Invoker.invoke(ProtobufRpcEngine.java:118) ~[hadoop-common-2.10.2.jar:?] Mar 24 09:32:29 at com.sun.proxy.$Proxy33.allocate(Unknown Source) ~[?:?] Mar 24 09:32:29 at org.apache.hadoop.yarn.api.impl.pb.client.ApplicationMasterProtocolPBClientImpl.allocate(ApplicationMasterProtocolPBClientImpl.java:77) ~[hadoop-yarn-common-2.10.2.jar:?] Mar 24 09:32:29 at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) ~[?:1.8.0_292] Mar 24 09:32:29 at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) ~[?:1.8.0_292] Mar 24 09:32:29 at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) ~[?:1.8.0_292] Mar 24 09:32:29 at java.lang.reflect.Method.invoke(Method.java:498) ~[?:1.8.0_292] Mar 24 09:32:29 at org.apache.hadoop.io.retry.RetryInvocationHandler.invokeMethod(RetryInvocationHandler.java:433) ~[hadoop-common-2.10.2.jar:?] Mar 24 09:32:29 at org.apache.hadoop.io.retry.RetryInvocationHandler$Call.invokeMethod(RetryInvocationHandler.java:166) ~[hadoop-common-2.10.2.jar:?] Mar 24 09:32:29 at org.apache.hadoop.io.retry.RetryInvocationHandler$Call.invoke(RetryInvocationHandler.java:158) ~[hadoop-common-2.10.2.jar:?] Mar 24 09:32:29 at org.apache.hadoop.io.retry.RetryInvocationHandler$Call.invokeOnce(RetryInvocationHandler.java:96) ~[hadoop-common-2.10.2.jar:?] Mar 24 09:32:29 at org.apache.hadoop.io.retry.RetryInvocationHandler.invoke(RetryInvocationHandler.java:362) ~[hadoop-common-2.10.2.jar:?] Mar 24 09:32:29 at com.sun.proxy.$Proxy34.allocate(Unknown Source) ~[?:?] Mar 24 09:32:29 at org.apache.hadoop.yarn.client.api.impl.AMRMClientImpl.allocate(AMRMClientImpl.java:297) ~[hadoop-yarn-client-2.10.2.jar:?] Mar 24 09:32:29 at org.apache.hadoop.yarn.client.api.async.impl.AMRMClientAsyncImpl$HeartbeatThread.run(AMRMClientAsyncImpl.java:274) [hadoop-yarn-client-2.10.2.jar:?] Mar 24 09:32:29 Caused by: java.lang.InterruptedException Mar 24 09:32:29 at java.util.concurrent.FutureTask.awaitDone(FutureTask.java:404) ~[?:1.8.0_292] Mar 24 09:32:29 at java.util.concurrent.FutureTask.get(FutureTask.java:191) ~[?:1.8.0_292] Mar 24 09:32:29 at org.apache.hadoop.ipc.Client$Connection.sendRpcRequest(Client.java:1177) ~[hadoop-common-2.10.2.jar:?] Mar 24 09:32:29 at org.apache.hadoop.ipc.Client.call(Client.java:1456) ~[hadoop-common-2.10.2.jar:?] Mar 24 09:32:29 ... 17 more {code} -- This message was sent by Atlassian Jira (v8.20.10#820010)