Zakelly Lan created FLINK-38347: ----------------------------------- Summary: Checkpoint file-merging manager may delete the directory unexpectedly when some RPC messages lost Key: FLINK-38347 URL: https://issues.apache.org/jira/browse/FLINK-38347 Project: Flink Issue Type: Bug Affects Versions: 1.20.2, 2.0.0, 2.1.0 Reporter: Zakelly Lan Assignee: Zakelly Lan
In FLINK-32086, we delete the orphan directories created by file-merging manager. The orphan check depends on the checkpoint notifications. So we should tolerate rpc messages lost, but current implementation using reference counting which does not verify the message completeness using checkpoint id. That may cause unexpected directory deletion, although this is rare. -- This message was sent by Atlassian Jira (v8.20.10#820010)