Yes. File got deleted . 2019-01-15 10:40:41,360 INFO FSNamesystem.audit: allowed=true ugi=hdfs (auth:SIMPLE) ip=/192.168.3.184 cmd=delete src=/pipeline/job/checkpoints/e9a08c0661a6c31b5af540cf352e1265/chk-470/5fb3a899-8c0f-45f6-a847-42cbb71e6d19 dst=null perm=null proto=rpc
Looks like file was deleted from job itself . Does it cause job restart then ? If checkpoint fails then it should try next checkpoint or restart job ? -- Sent from: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/