Re: Checkpoint loading failure

2021-06-16 Thread Guowei Ma
Hi Padarn Will there be these errors if the jobgraph is not modified? In addition, is this error stack all? Is it possible that other errors caused the stream to be closed? Best, Guowei On Tue, Jun 15, 2021 at 9:54 PM Padarn Wilson wrote: > Hi all, > > We have a job that has a medium size state

Checkpoint loading failure

2021-06-15 Thread Padarn Wilson
Hi all, We have a job that has a medium size state (around 4GB) and after adding a new part of the job graph (which should not impact the job too much) we found that every single checkpoint restore has the following error: Caused by: java.io.IOException: s3a://: Stream is closed! > at > org.apach