xingbe created FLINK-35522:
------------------------------
Summary: The source task may get stuck after a failover occurs in
batch jobs
Key: FLINK-35522
URL: https://issues.apache.org/jira/browse/FLINK-35522
Project: Flink
Issue Type: Bug
Components: Runtime / Coordination
Affects Versions: 1.18.1, 1.19.0, 1.17.2, 1.20.0
Reporter: xingbe
Fix For: 1.20.0
If the source task does not get assigned a split because the SplitEnumerator
has no more splits, and a failover occurs during the closing process, the
SourceCoordinatorContext will not resend the NoMoreSplit event to the newly
started source task, causing the source vertex to remain stuck indefinitely.
This case may only occur in batch jobs where speculative execution has been
enabled.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)