Matthias Pohl created FLINK-33500:
-------------------------------------

             Summary: Run storing the JobGraph an asynchronous operation
                 Key: FLINK-33500
                 URL: https://issues.apache.org/jira/browse/FLINK-33500
             Project: Flink
          Issue Type: Improvement
          Components: Runtime / Coordination
    Affects Versions: 1.17.1, 1.18.0, 1.19.0
            Reporter: Matthias Pohl


Currently, submitting a job starts with storing the JobGraph (in HA setups) in 
the {{{}JobGraphStore{}}}. This includes writing the file to S3 (or some other 
remote file system). The job submission is done in the {{{}Dispatcher{}}}'s 
main thread. If writing the {{JobGraph}} is slow, it would block any other 
operation on the {{{}Dispatcher{}}}. See 
[Dispatcher#persistAndRunJob|https://github.com/apache/flink/blob/52cbeb90f32ca36c59590df1daa6748995c9b7f8/flink-runtime/src/main/java/org/apache/flink/runtime/dispatcher/Dispatcher.java#L645]
 as code reference.

This Jira issue is about moving the job submission into the {{ioExecutor}} as 
an asynchronous call.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to