I have some updates. Some weird behaviours were found. Please refer to the attached photo.
All requests were sent via REST API The status of the savepoint triggered by that stop request (ID 11018) is "COMPLETED [Savepoint]", however, no checkpoint data has been persisted (in S3). The folder /`savepoint-5871af-c0f2d2334501/_metadata/`/ has been created in S3, but no files in that. This was the command I used to send the first stop request: /curl -s -d '{"drain": false, "targetDirectory":"*s3*://mybucket/savepoint"}' -H 'Content-Type: application/json' -X POST http://myip:45507/jobs/5871af88ff279f30ebcc49ce741c2d75/stop/ Suspected that /s3:/// might be the issue, I tried to send another stop request (ID 11020), mistakenly having the path as /s3*s*:///. So it failed. Another stop request was sent (ID 11021). This one failed after timeout (10 minutes). The GUI says the checkpoint failed with /`Checkpoint expired before completing`/. /curl -s -d '{"drain": false, "targetDirectory":"s3*a*://mybucket/savepoint"}' -H 'Content-Type: application/json' -X POST http://myip:45507/jobs/5871af88ff279f30ebcc49ce741c2d75/stop/ I tried to send a create-savepoint request (ID 11023), and this time, it completed successfully, with files persisted to S3. Checking Flink GUI I could see that the job actually resumed before that savepoint request (with the checkpoint ID 11021 created just 30 seconds after 11021 expired). /curl -s -d '{"target-directory":"s3a://mybucket/savepoint", "cancel-job": false}' -H 'Content-Type: application/json' -X POST http://myip:45507/jobs/5871af88ff279f30ebcc49ce741c2d75/savepoints / <http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/file/t1586/Screen_Shot_2020-11-13_at_11.png> -- Sent from: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/