[ 
https://issues.apache.org/jira/browse/FLINK-22453?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Liu updated FLINK-22453:
------------------------
    Description: 
flink version: 1.12.2

yarn version : 3.1.1 (hdp 3.1.5)
h3. Starting a Flink Session on YARN

when i use ' flink stop xxxxxxxxxxx',   comond line output:

 
=========================================================
 
{noformat}
Setting HBASE_CONF_DIR=/etc/hbase/conf because no HBASE_CONF_DIR was set.
SLF4J: Class path contains multiple SLF4J bindings.
SLF4J: Found binding in 
[jar:file:/data/flink-1.12.2/lib/log4j-slf4j-impl-2.12.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: Found binding in 
[jar:file:/usr/hdp/3.1.5.0-152/hadoop/lib/slf4j-log4j12-1.7.25.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an explanation.
SLF4J: Actual binding is of type [org.apache.logging.slf4j.Log4jLoggerFactory]
2021-04-25 16:10:43,369 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli     
           [] - Found Yarn properties file under /tmp/.yarn-properties-flink.
2021-04-25 16:10:43,369 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli     
           [] - Found Yarn properties file under /tmp/.yarn-properties-flink.
Suspending job "a81c0fe295871ef278a119cd44206216" with a savepoint.
2021-04-25 16:10:45,126 INFO  org.apache.hadoop.yarn.client.AHSProxy            
           [] - Connecting to Application History server at 
adt-bd-c1-nn03.internal/172.20.33.149:10200
2021-04-25 16:10:45,174 INFO  org.apache.flink.yarn.YarnClusterDescriptor       
           [] - No path for the flink jar passed. Using the location of class 
org.apache.flink.yarn.YarnClusterDescriptor to locate the jar
2021-04-25 16:10:45,520 INFO  org.apache.flink.yarn.YarnClusterDescriptor       
           [] - Found Web Interface adt-bd-c1-flink06.internal:43379 of 
application 
'application_1618023905026_0005'.------------------------------------------------------------
The program finished with the following 
exception:org.apache.flink.util.FlinkException: Could not stop with a savepoint 
job "a81c0fe295871ef278a119cd44206216".
    at 
org.apache.flink.client.cli.CliFrontend.lambda$stop$5(CliFrontend.java:581)
    at 
org.apache.flink.client.cli.CliFrontend.runClusterAction(CliFrontend.java:1002)
    at org.apache.flink.client.cli.CliFrontend.stop(CliFrontend.java:569)
    at 
org.apache.flink.client.cli.CliFrontend.parseAndRun(CliFrontend.java:1069)
    at 
org.apache.flink.client.cli.CliFrontend.lambda$main$10(CliFrontend.java:1132)
    at java.security.AccessController.doPrivileged(Native Method)
    at javax.security.auth.Subject.doAs(Subject.java:422)
    at 
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1730)
    at 
org.apache.flink.runtime.security.contexts.HadoopSecurityContext.runSecured(HadoopSecurityContext.java:41)
    at org.apache.flink.client.cli.CliFrontend.main(CliFrontend.java:1132)
Caused by: java.util.concurrent.ExecutionException: 
java.util.concurrent.TimeoutException
    at 
java.util.concurrent.CompletableFuture.reportGet(CompletableFuture.java:357)
    at java.util.concurrent.CompletableFuture.get(CompletableFuture.java:1915)
    at 
org.apache.flink.client.cli.CliFrontend.lambda$stop$5(CliFrontend.java:579)
    ... 9 more
Caused by: java.util.concurrent.TimeoutException
    at 
org.apache.flink.runtime.concurrent.FutureUtils$Timeout.run(FutureUtils.java:1220)
    at 
org.apache.flink.runtime.concurrent.DirectExecutorService.execute(DirectExecutorService.java:217)
    at 
org.apache.flink.runtime.concurrent.FutureUtils.lambda$orTimeout$15(FutureUtils.java:582)
    at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
    at java.util.concurrent.FutureTask.run(FutureTask.java:266)
    at 
java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$201(ScheduledThreadPoolExecutor.java:180)
    at 
java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:293)
    at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
    at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
    at java.lang.Thread.run(Thread.java:748){noformat}
 
=========================================================
 flink can't stop the job,  

but when i user  'flink stop -m jobmanager.server.host:port    xxxxx'  ,  it 
work well.

'-m' is an option args,  The old version does not have this problem

 

  was:
flink version: 1.12.2

yarn version : 3.1.1 (hdp 3.1.5)
h3. Starting a Flink Session on YARN

when i use ' flink stop xxxxxxxxxxx',   comond line output:
{panel:title=我的标题}
 

Setting HBASE_CONF_DIR=/etc/hbase/conf because no HBASE_CONF_DIR was set.SLF4J: 
Class path contains multiple SLF4J bindings.SLF4J: Found binding in 
[jar:file:/data/flink-1.12.2/lib/log4j-slf4j-impl-2.12.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]SLF4J:
 Found binding in 
[jar:file:/usr/hdp/3.1.5.0-152/hadoop/lib/slf4j-log4j12-1.7.25.jar!/org/slf4j/impl/StaticLoggerBinder.class]SLF4J:
 See http://www.slf4j.org/codes.html#multiple_bindings for an 
explanation.SLF4J: Actual binding is of type 
[org.apache.logging.slf4j.Log4jLoggerFactory]2021-04-25 16:10:43,369 INFO  
org.apache.flink.yarn.cli.FlinkYarnSessionCli                [] - Found Yarn 
properties file under /tmp/.yarn-properties-flink.2021-04-25 16:10:43,369 INFO  
org.apache.flink.yarn.cli.FlinkYarnSessionCli                [] - Found Yarn 
properties file under /tmp/.yarn-properties-flink.Suspending job 
"a81c0fe295871ef278a119cd44206216" with a savepoint.2021-04-25 16:10:45,126 
INFO  org.apache.hadoop.yarn.client.AHSProxy                       [] - 
Connecting to Application History server at 
adt-bd-c1-nn03.internal/172.20.33.149:102002021-04-25 16:10:45,174 INFO  
org.apache.flink.yarn.YarnClusterDescriptor                  [] - No path for 
the flink jar passed. Using the location of class 
org.apache.flink.yarn.YarnClusterDescriptor to locate the jar2021-04-25 
16:10:45,520 INFO  org.apache.flink.yarn.YarnClusterDescriptor                  
[] - Found Web Interface adt-bd-c1-flink06.internal:43379 of application 
'application_1618023905026_0005'.
------------------------------------------------------------The program 
finished with the following exception:
org.apache.flink.util.FlinkException: Could not stop with a savepoint job 
"a81c0fe295871ef278a119cd44206216". at 
org.apache.flink.client.cli.CliFrontend.lambda$stop$5(CliFrontend.java:581) at 
org.apache.flink.client.cli.CliFrontend.runClusterAction(CliFrontend.java:1002) 
at org.apache.flink.client.cli.CliFrontend.stop(CliFrontend.java:569) at 
org.apache.flink.client.cli.CliFrontend.parseAndRun(CliFrontend.java:1069) at 
org.apache.flink.client.cli.CliFrontend.lambda$main$10(CliFrontend.java:1132) 
at java.security.AccessController.doPrivileged(Native Method) at 
javax.security.auth.Subject.doAs(Subject.java:422) at 
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1730)
 at 
org.apache.flink.runtime.security.contexts.HadoopSecurityContext.runSecured(HadoopSecurityContext.java:41)
 at org.apache.flink.client.cli.CliFrontend.main(CliFrontend.java:1132)Caused 
by: java.util.concurrent.ExecutionException: 
java.util.concurrent.TimeoutException at 
java.util.concurrent.CompletableFuture.reportGet(CompletableFuture.java:357) at 
java.util.concurrent.CompletableFuture.get(CompletableFuture.java:1915) at 
org.apache.flink.client.cli.CliFrontend.lambda$stop$5(CliFrontend.java:579) ... 
9 moreCaused by: java.util.concurrent.TimeoutException at 
org.apache.flink.runtime.concurrent.FutureUtils$Timeout.run(FutureUtils.java:1220)
 at 
org.apache.flink.runtime.concurrent.DirectExecutorService.execute(DirectExecutorService.java:217)
 at 
org.apache.flink.runtime.concurrent.FutureUtils.lambda$orTimeout$15(FutureUtils.java:582)
 at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) at 
java.util.concurrent.FutureTask.run(FutureTask.java:266) at 
java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$201(ScheduledThreadPoolExecutor.java:180)
 at 
java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:293)
 at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) 
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) 
at java.lang.Thread.run(Thread.java:748)
{panel}
 

 flink can't stop the job,  

but when i user  'flink stop -m jobmanager.server.host:port    xxxxx'  ,  it 
work well.

'-m' is an option args,  The old version does not have this problem

 


> Can not stop job when not ust "-m" option
> -----------------------------------------
>
>                 Key: FLINK-22453
>                 URL: https://issues.apache.org/jira/browse/FLINK-22453
>             Project: Flink
>          Issue Type: Bug
>          Components: API / Core
>    Affects Versions: 1.12.2
>            Reporter: Liu
>            Priority: Minor
>
> flink version: 1.12.2
> yarn version : 3.1.1 (hdp 3.1.5)
> h3. Starting a Flink Session on YARN
> when i use ' flink stop xxxxxxxxxxx',   comond line output:
>  
> =========================================================
>  
> {noformat}
> Setting HBASE_CONF_DIR=/etc/hbase/conf because no HBASE_CONF_DIR was set.
> SLF4J: Class path contains multiple SLF4J bindings.
> SLF4J: Found binding in 
> [jar:file:/data/flink-1.12.2/lib/log4j-slf4j-impl-2.12.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
> SLF4J: Found binding in 
> [jar:file:/usr/hdp/3.1.5.0-152/hadoop/lib/slf4j-log4j12-1.7.25.jar!/org/slf4j/impl/StaticLoggerBinder.class]
> SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an 
> explanation.
> SLF4J: Actual binding is of type [org.apache.logging.slf4j.Log4jLoggerFactory]
> 2021-04-25 16:10:43,369 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli   
>              [] - Found Yarn properties file under 
> /tmp/.yarn-properties-flink.
> 2021-04-25 16:10:43,369 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli   
>              [] - Found Yarn properties file under 
> /tmp/.yarn-properties-flink.
> Suspending job "a81c0fe295871ef278a119cd44206216" with a savepoint.
> 2021-04-25 16:10:45,126 INFO  org.apache.hadoop.yarn.client.AHSProxy          
>              [] - Connecting to Application History server at 
> adt-bd-c1-nn03.internal/172.20.33.149:10200
> 2021-04-25 16:10:45,174 INFO  org.apache.flink.yarn.YarnClusterDescriptor     
>              [] - No path for the flink jar passed. Using the location of 
> class org.apache.flink.yarn.YarnClusterDescriptor to locate the jar
> 2021-04-25 16:10:45,520 INFO  org.apache.flink.yarn.YarnClusterDescriptor     
>              [] - Found Web Interface adt-bd-c1-flink06.internal:43379 of 
> application 
> 'application_1618023905026_0005'.------------------------------------------------------------
> The program finished with the following 
> exception:org.apache.flink.util.FlinkException: Could not stop with a 
> savepoint job "a81c0fe295871ef278a119cd44206216".
>     at 
> org.apache.flink.client.cli.CliFrontend.lambda$stop$5(CliFrontend.java:581)
>     at 
> org.apache.flink.client.cli.CliFrontend.runClusterAction(CliFrontend.java:1002)
>     at org.apache.flink.client.cli.CliFrontend.stop(CliFrontend.java:569)
>     at 
> org.apache.flink.client.cli.CliFrontend.parseAndRun(CliFrontend.java:1069)
>     at 
> org.apache.flink.client.cli.CliFrontend.lambda$main$10(CliFrontend.java:1132)
>     at java.security.AccessController.doPrivileged(Native Method)
>     at javax.security.auth.Subject.doAs(Subject.java:422)
>     at 
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1730)
>     at 
> org.apache.flink.runtime.security.contexts.HadoopSecurityContext.runSecured(HadoopSecurityContext.java:41)
>     at org.apache.flink.client.cli.CliFrontend.main(CliFrontend.java:1132)
> Caused by: java.util.concurrent.ExecutionException: 
> java.util.concurrent.TimeoutException
>     at 
> java.util.concurrent.CompletableFuture.reportGet(CompletableFuture.java:357)
>     at java.util.concurrent.CompletableFuture.get(CompletableFuture.java:1915)
>     at 
> org.apache.flink.client.cli.CliFrontend.lambda$stop$5(CliFrontend.java:579)
>     ... 9 more
> Caused by: java.util.concurrent.TimeoutException
>     at 
> org.apache.flink.runtime.concurrent.FutureUtils$Timeout.run(FutureUtils.java:1220)
>     at 
> org.apache.flink.runtime.concurrent.DirectExecutorService.execute(DirectExecutorService.java:217)
>     at 
> org.apache.flink.runtime.concurrent.FutureUtils.lambda$orTimeout$15(FutureUtils.java:582)
>     at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
>     at java.util.concurrent.FutureTask.run(FutureTask.java:266)
>     at 
> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$201(ScheduledThreadPoolExecutor.java:180)
>     at 
> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:293)
>     at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
>     at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
>     at java.lang.Thread.run(Thread.java:748){noformat}
>  
> =========================================================
>  flink can't stop the job,  
> but when i user  'flink stop -m jobmanager.server.host:port    xxxxx'  ,  it 
> work well.
> '-m' is an option args,  The old version does not have this problem
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to