Re: Re: Re: Checkpoint Error

2021-03-10 Thread Till Rohrmann
og? > > Also, have you enabled concurrent checkpoint? > > Best, > Yun > > > --Original Mail -- > *Sender:*Navneeth Krishnan > *Send Date:*Mon Mar 8 13:10:46 2021 > *Recipients:*Yun Gao > *CC:*user > *Subject:*Re: Re: Checkpoint

Re: Re: Re: Checkpoint Error

2021-03-08 Thread Yun Gao
Hi Navneeth, Is the attached exception the root cause for the checkpoint failure ? Namely is it also reported in job manager log? Also, have you enabled concurrent checkpoint? Best, Yun --Original Mail -- Sender:Navneeth Krishnan Send Date:Mon Mar 8 13:10:4

Re: Re: Checkpoint Error

2021-03-07 Thread Navneeth Krishnan
Hi Yun, Thanks for the response. I checked the mounts and only the JM's and TM's are mounted with this EFS. Not sure how to debug this. Thanks On Sun, Mar 7, 2021 at 8:29 PM Yun Gao wrote: > Hi Navneeth, > > It seems from the stack that the exception is caused by the underlying EFS > problems

Re: Re: Checkpoint Error

2021-03-07 Thread Yun Gao
Hi Navneeth, It seems from the stack that the exception is caused by the underlying EFS problems ? Have you checked if there are errors reported for EFS, or if there might be duplicate mounting for the same EFS and others have ever deleted the directory? Best, Yun --Original