Matthias Pohl created FLINK-33500: ------------------------------------- Summary: Run storing the JobGraph an asynchronous operation Key: FLINK-33500 URL: https://issues.apache.org/jira/browse/FLINK-33500 Project: Flink Issue Type: Improvement Components: Runtime / Coordination Affects Versions: 1.17.1, 1.18.0, 1.19.0 Reporter: Matthias Pohl
Currently, submitting a job starts with storing the JobGraph (in HA setups) in the {{{}JobGraphStore{}}}. This includes writing the file to S3 (or some other remote file system). The job submission is done in the {{{}Dispatcher{}}}'s main thread. If writing the {{JobGraph}} is slow, it would block any other operation on the {{{}Dispatcher{}}}. See [Dispatcher#persistAndRunJob|https://github.com/apache/flink/blob/52cbeb90f32ca36c59590df1daa6748995c9b7f8/flink-runtime/src/main/java/org/apache/flink/runtime/dispatcher/Dispatcher.java#L645] as code reference. This Jira issue is about moving the job submission into the {{ioExecutor}} as an asynchronous call. -- This message was sent by Atlassian Jira (v8.20.10#820010)