Hi Lars,

Currently, there is no configuration available to trigger a checkpoint
immediately after the job starts in Flink.

But we
can address this issue from multiple perspectives using the insights
provided in this document
[1].


[1]  
https://nightlies.apache.org/flink/flink-docs-release-1.19/docs/ops/state/large_state_tuning/



Best regards,
Zhongqiang Gong

Lars Skjærven <lar...@gmail.com> 于2024年5月15日周三 05:10写道:

> Hello,
>
> When restarting jobs (e.g. after upgrade) with "large" state a task can
> take some time to "initialize" (depending on the state size). During this
> time I noticed that Flink attempts to checkpoint. In many cases
> checkpointing will fail repeatedly, and cause the job to hit the
> tolerable-failed-checkpoints limit and restart. The only way to overcome
> the issue seems to be to increase the checkpoint interval, but this is
> suboptimal.
>
> Could Flink wait to trigger checkpointing when one or more task is
> initializing?
>
> Lars
>

Reply via email to