I see for every consequential checkpoint timeout fail , number of tasks
which completed checkpointing keeps decreasing, why would that happen? Does
flink try to process data beyond old checkpoint barrier which failed to
complete due to timeout?


On Tue, Mar 8, 2022 at 12:48 AM yidan zhao <hinobl...@gmail.com> wrote:

> If the checkpoint timeout leads to the job's fail, then the job will be
> recovered and data will be reprocessed from the last completed checkpoint.
> If the job doesn't fail, then not.
>
> Mahantesh Patil <mahantesh...@gmail.com> 于2022年3月8日周二 14:47写道:
>
>> Hello Team,
>>
>> What happens after checkpoint timeout?
>>
>> Does Flink reprocess data from the previous checkpoint for all tasks?
>>
>> I have one compute intensive operator with parallelism of 20 and only one
>> of the parallel tasks seems to get stuck because of data skew. On
>> checkpoint timeout , will data be reprocessed or continue processing new
>> data? If not, will increasing checkpoint timeout help.
>>
>> Checkpoint Configuration:
>>
>> CheckpointingMode.EXACTLY_ONCE;
>>
>> CheckPointTimeOut 10 min;
>>
>> MinPauseBetweenCheckpoints 30 sec;
>>
>> CheckPointingInterval 30 sec;
>>
>>
>> Thanks,
>> Mahantesh
>>
>>

Reply via email to