Hi,

I recently encountered an issue while using StreamingFileSink.

I have a flink job consuming records from various sources and write to s3
with streaming file sink. But the job sometimes fails due to checkpoint
timeout, and the root cause is checkpoint alignment failure as there is
data skewness between different data sources.

I don't want to enable unaligned checkpointing but prefer to do some
checkpoint tuning first.

My current checkpoint interval is 1 min and timeout is also 1 min. I wanna
increase *tolerable checkpoint failure number* to 5, as I believe the
unaligned subtasks will definitely update their watermark in 5 minutes. My
question is, will streaming file sink still writes to s3 even if the
checkpoint fails or just wait until next successful checkpoint? (as if we
don't tolerate checkpoint failure, the job will simply restart from last
successful checkpoint)


Thanks.

Best,
Kevin

Reply via email to