Hi, In the background it is a REST call to Flink. If it takes too long to create the savepoint, you might hit a timeout. You can increase this using the configuration client.timeout [1]. You can also use the --detached option for the stop action, which will return once it receives a trigger ID from Flink. [2]
Best, Mate [1] https://nightlies.apache.org/flink/flink-docs-release-1.20/docs/deployment/config/#client-timeout [2] https://nightlies.apache.org/flink/flink-docs-release-1.20/docs/deployment/cli/#creating-a-savepoint Sachin Mittal <sjmit...@gmail.com> ezt írta (időpont: 2024. okt. 11., P, 6:19): > Hello, > I am running a flink job which I stop it with a savepoint: > > ./bin/flink stop --savepointPath /tmp/flink-savepoints > 0b3b584a298afa372491eff5e3d2160b > Suspending job "0b3b584a298afa372491eff5e3d2160b" with a CANONICAL > savepoint. > > > However this is what I get in the cli > ------------------------------------------------------------ > The program finished with the following exception: > > org.apache.flink.util.FlinkException: Could not stop with a savepoint job > "0b3b584a298afa372491eff5e3d2160b". > at > org.apache.flink.client.cli.CliFrontend.lambda$stop$4(CliFrontend.java:595) > at > org.apache.flink.client.cli.CliFrontend.runClusterAction(CliFrontend.java:1041) > at org.apache.flink.client.cli.CliFrontend.stop(CliFrontend.java:578) > at > org.apache.flink.client.cli.CliFrontend.parseAndRun(CliFrontend.java:1110) > at > org.apache.flink.client.cli.CliFrontend.lambda$mainInternal$9(CliFrontend.java:1189) > at > org.apache.flink.runtime.security.contexts.NoOpSecurityContext.runSecured(NoOpSecurityContext.java:28) > at > org.apache.flink.client.cli.CliFrontend.mainInternal(CliFrontend.java:1189) > at org.apache.flink.client.cli.CliFrontend.main(CliFrontend.java:1157) > Caused by: java.util.concurrent.TimeoutException > at > java.base/java.util.concurrent.CompletableFuture.timedGet(CompletableFuture.java:1950) > at > java.base/java.util.concurrent.CompletableFuture.get(CompletableFuture.java:2085) > at > org.apache.flink.client.cli.CliFrontend.lambda$stop$4(CliFrontend.java:591) > ... 7 more > > > What I also see is that actually a savepoint does get generated at the > specified path and my flink job is also stopped after a while. > > Is there any setting which is making the cli program to timeout and is > there a way we can verify that the entire savepoint got generated on the > specified path ? > > Thanks > Sachin > >