Hi Lars, Currently, there is no configuration available to trigger a checkpoint immediately after the job starts in Flink.
But we can address this issue from multiple perspectives using the insights provided in this document [1]. [1] https://nightlies.apache.org/flink/flink-docs-release-1.19/docs/ops/state/large_state_tuning/ Best regards, Zhongqiang Gong Lars Skjærven <lar...@gmail.com> 于2024年5月15日周三 05:10写道: > Hello, > > When restarting jobs (e.g. after upgrade) with "large" state a task can > take some time to "initialize" (depending on the state size). During this > time I noticed that Flink attempts to checkpoint. In many cases > checkpointing will fail repeatedly, and cause the job to hit the > tolerable-failed-checkpoints limit and restart. The only way to overcome > the issue seems to be to increase the checkpoint interval, but this is > suboptimal. > > Could Flink wait to trigger checkpointing when one or more task is > initializing? > > Lars >