[ https://issues.apache.org/jira/browse/FLINK-6187?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Robert Metzger updated FLINK-6187: ---------------------------------- Component/s: YARN > Cancel job failed with option -m on yarn session > ------------------------------------------------ > > Key: FLINK-6187 > URL: https://issues.apache.org/jira/browse/FLINK-6187 > Project: Flink > Issue Type: Bug > Components: Client, YARN > Reporter: Yuhong Hong > > 1. start yarn session: ./bin/yarn-session.sh -n 3 -jm 2048 -tm 3096 > 2. submit a job: ./bin/flink run ... > 3. cancel the job with option -m: > ./bin/flink cancel -m ip:port jobid > {code} > org.apache.flink.configuration.IllegalConfigurationException: Couldn't > retrieve client for cluster > at > org.apache.flink.client.CliFrontend.retrieveClient(CliFrontend.java:912) > at > org.apache.flink.client.CliFrontend.getJobManagerGateway(CliFrontend.java:926) > at org.apache.flink.client.CliFrontend.cancel(CliFrontend.java:602) > at > org.apache.flink.client.CliFrontend.parseParameters(CliFrontend.java:1079) > at org.apache.flink.client.CliFrontend$2.call(CliFrontend.java:1120) > at org.apache.flink.client.CliFrontend$2.call(CliFrontend.java:1117) > at > org.apache.flink.runtime.security.HadoopSecurityContext$1.run(HadoopSecurityContext.java:43) > at java.security.AccessController.doPrivileged(Native Method) > at javax.security.auth.Subject.doAs(Subject.java:422) > at > org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1738) > at > org.apache.flink.runtime.security.HadoopSecurityContext.runSecured(HadoopSecurityContext.java:40) > at org.apache.flink.client.CliFrontend.main(CliFrontend.java:1116) > Caused by: java.lang.RuntimeException: Failed to retrieve JobManager address > at > org.apache.flink.client.program.ClusterClient.getJobManagerAddress(ClusterClient.java:248) > at > org.apache.flink.client.CliFrontend.retrieveClient(CliFrontend.java:908) > ... 11 more > Caused by: org.apache.flink.runtime.leaderretrieval.LeaderRetrievalException: > Could not retrieve the leader address and leader session ID. > at > org.apache.flink.runtime.util.LeaderRetrievalUtils.retrieveLeaderConnectionInfo(LeaderRetrievalUtils.java:175) > at > org.apache.flink.client.program.ClusterClient.getJobManagerAddress(ClusterClient.java:242) > ... 12 more > Caused by: java.util.concurrent.TimeoutException: Futures timed out after > [60000 milliseconds] > at > scala.concurrent.impl.Promise$DefaultPromise.ready(Promise.scala:219) > at > scala.concurrent.impl.Promise$DefaultPromise.result(Promise.scala:223) > at scala.concurrent.Await$$anonfun$result$1.apply(package.scala:190) > at > scala.concurrent.BlockContext$DefaultBlockContext$.blockOn(BlockContext.scala:53) > at scala.concurrent.Await$.result(package.scala:190) > at scala.concurrent.Await.result(package.scala) > at > org.apache.flink.runtime.util.LeaderRetrievalUtils.retrieveLeaderConnectionInfo(LeaderRetrievalUtils.java:173) > ... 13 more > {code} -- This message was sent by Atlassian JIRA (v6.3.15#6346)