Hi, I was experimenting with HA lately and see that it recovers successfully job, in the case of jobmanager restarts. Now my question is whether it will work for the job cluster. Based on the instructions https://github.com/apache/flink/blob/release-1.8/flink-container/docker/README.md <https://github.com/apache/flink/blob/release-1.8/flink-container/docker/README.md> I can see https://github.com/apache/flink/blob/release-1.8/flink-container/docker/docker-entrypoint.sh <https://github.com/apache/flink/blob/release-1.8/flink-container/docker/docker-entrypoint.sh> that In this case the following command is invoked: exec $FLINK_HOME/bin/standalone-job.sh start-foreground "$@“
Which means that if a jobManager restarts, the following is going to happen: 1. It will use HA to restore job that was running 2. A new job will be submitted, overwriting restored job and bypassing checkpoint restore. Am I missing something here? Boris Lublinsky FDP Architect boris.lublin...@lightbend.com https://www.lightbend.com/