Re: DFS problem with removing checkpoint

2018-05-16 Thread Szymon Szczypiński
Hi, now i know why those files wasn't "remove". They remove but very slow. In my case(Flink 1.3) the problem is in line client.delete().inBackground(backgroundCallback, executor).forPath(path); where deletion is in background in executor pool where size is equal to 2. When i have more files/d

Re: DFS problem with removing checkpoint

2018-04-22 Thread Szymon Szczypiński
HI, the problem was started on 1.3.1. Now I upgraded to Flink 1.3.3. I changed my cluster to 1.3.3  because of jira https://issues.apache.org/jira/browse/FLINK-8807. I will check in debug mode why cluster doesn't remove those files, maybe i will see why. Best regards On 22.04.2018 16:59, St

Re: DFS problem with removing checkpoint

2018-04-22 Thread Stephan Ewen
Hi! Sorry for the late response... In which Flink version are you? I am wondering if this is somewhat related to that specific setup: Windows DFS filesystem mounted on Linux with CIFS - For the "completedCheckpoint", the cleanup should happen in the "ZooKeeperCompletedCheckpointStore" when d

Re: DFS problem with removing checkpoint

2018-04-06 Thread Szymon Szczypiński
Hi, in my case both doesn't deleted. In high-availability.storageDir the number of files of type "completedCheckpoint" are growing and also dirs in "state.backend.fs.checkpointdir/JobId/check-". In my case i have Windows DFS filesystem mounted on linux with cifs protocol. Can you give me a

Re: DFS problem with removing checkpoint

2018-04-02 Thread Stephan Ewen
Can you clarify which one does not get deleted? The file in the "high-availability.storageDir", or the "state.backend.fs.checkpointdir/JobId/check-", or both? Could you also tell us which file system you use? There is a known issue in some versions of Flink that S3 "directories" are not deleted.

DFS problem with removing checkpoint

2018-03-28 Thread Szymon Szczypiński
Hi, i have problem with Flink in version 1.3.1. I have standalone cluster with two JobManagers and four TaskManager, as DFS i use windows high available storage mounted by cifs protocol. And sometimes i'm starting having problem that Flink doesn't remove checkpoint dirs for job and completedChe