rmetzger opened a new pull request #13217: URL: https://github.com/apache/flink/pull/13217
## What is the purpose of the change This is changing the semantics of the job submission: Instead of completing the `Dispatcher.submitJob()` future after all the initialization happened (which can potentially involve calling external systems etc.), the `.submitJob()` call now returns as soon as the job has been accepted by the Dispatcher. The benefit of this change is that the users will see the root cause of a submission timeout, instead of an akka.ask.timeout. ## Brief change log - Introduce a `DispatcherJob` abstraction that manages the job in a new `INITIALIZING` state - Change web frontend to cope with initializing jobs - change clients to submit & poll ## Verifying this change This PR introduces various new tests for verification. ## Does this pull request potentially affect one of the following parts: - Dependencies (does it add or upgrade a dependency): no - The public API, i.e., is any changed class annotated with `@Public(Evolving)`: no - The serializers: no - The runtime per-record code paths (performance sensitive): no - Anything that affects deployment or recovery: JobManager (and its components), Checkpointing, Kubernetes/Yarn/Mesos, ZooKeeper: yes 🔥 - The S3 file system connector: no ## Documentation This change is transparent to the user and doesn't need a documentation update. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org