Re: Checkpointing not working

Stefan Richter Thu, 20 Sep 2018 03:06:01 -0700

Hi,

in the absence of any logs, my guess would be that your checkpoints are just 
not able to complete within 10 seconds, the state might be to large or the 
network and fs to slow. Are you using full or incremental checkpoints? For your 
relative small interval, I suggest that you try using incremental checkpoints. 
Still thinking that your timeout and interval is pretty ambitious.


Best,
Stefan

> Am 20.09.2018 um 10:17 schrieb vino yang <yanghua1...@gmail.com>:
> 
> Hi Yubraj,
> 
> Can you set your log print level to DEBUG and share it with us or share a 
> screenshot of your Flink web UI checkpoint information?
> 
> Thanks, vino.
> 
> Jörn Franke <jornfra...@gmail.com <mailto:jornfra...@gmail.com>> 
> 于2018年9月19日周三 下午2:37写道：
> What do the logfiles say?
> 
> How does the source code looks like?
> 
> Is it really needed to do checkpointing every 30 seconds?
> 
> On 19. Sep 2018, at 08:25, yuvraj singh <19yuvrajsing...@gmail.com 
> <mailto:19yuvrajsing...@gmail.com>> wrote:
> 
>> Hi , 
>> 
>> I am doing checkpointing using s3 and rocksdb , 
>> i am doing checkpointing per 30 seconds and time out is 10 seconds .
>> 
>> most of the time its failing by saying Failure Time: 11:53:17Cause: 
>> Checkpoint expired before completing .
>> I  increases the timeout  as well still it not working for me .
>> 
>> please suggest .
>> 
>> Thanks 
>> Yubraj Singh

Re: Checkpointing not working

Reply via email to