Re: Total recovery time estimation after checkpoint recovery

2020-02-19 Thread Till Rohrmann
> From: Till Rohrmann > Sent: Tuesday, February 18, 2020 4:43 PM > To: dev > Subject: Re: Total recovery time estimation after checkpoint recovery > > Hi Jessica, > > did you receive my previous email with the explanation? > > Cheers, >

Re: Total recovery time estimation after checkpoint recovery

2020-02-18 Thread Woods, Jessica Hui
Hi Till, No, I have not received any emails regarding my question. Could you please forward your response to me? Thanks From: Till Rohrmann Sent: Tuesday, February 18, 2020 4:43 PM To: dev Subject: Re: Total recovery time estimation after checkpoint

Re: Total recovery time estimation after checkpoint recovery

2020-02-18 Thread Till Rohrmann
Hi Jessica, did you receive my previous email with the explanation? Cheers, Till On Sat, Feb 15, 2020 at 11:45 PM Woods, Jessica Hui < jessica.wo...@campus.tu-berlin.de> wrote: > ??Hi, > > I am working with Apache Flink and am interested in knowing how one could > estimate the total amount of t

Re: Total recovery time estimation after checkpoint recovery

2020-02-14 Thread Till Rohrmann
Hi Jessica, multiple factors affect the total recovery time. First of all, Flink needs to detect that something went wrong. In the worst case this happens through the missing heartbeat of a died machine. The default heartbeat value is configured to 50s but one can tune it. Next, Flink needs to ca