Hi,
I was experimenting with HA lately and see that it recovers successfully job, 
in the case of jobmanager restarts.
Now my question is whether it will work for the job cluster.
Based on the instructions 
https://github.com/apache/flink/blob/release-1.8/flink-container/docker/README.md
 
<https://github.com/apache/flink/blob/release-1.8/flink-container/docker/README.md>
I can see 
https://github.com/apache/flink/blob/release-1.8/flink-container/docker/docker-entrypoint.sh
 
<https://github.com/apache/flink/blob/release-1.8/flink-container/docker/docker-entrypoint.sh>
 that
In this case the following command is invoked:
exec $FLINK_HOME/bin/standalone-job.sh start-foreground "$@“

Which means that if a jobManager restarts, the following is going to happen:

1. It will use HA to restore job that was running 
2. A new job will be submitted, overwriting restored job and bypassing 
checkpoint restore.

Am I missing something here?


Boris Lublinsky
FDP Architect
boris.lublin...@lightbend.com
https://www.lightbend.com/

Reply via email to