Hi all, One major problem of Flink batch jobs is slow tasks running on hot/bad nodes, resulting in very long execution time.
In order to solve this problem, FLIP-168: Speculative Execution for Batch Job[1] is introduced and approved recently. Here, Zhu Zhu and I propose to support speculative execution of sources as one of follow up of FLIP-168. You could find more details in FLIP-245[2]. Looking forward to your feedback. Best, Jing Zhang [1] https://cwiki.apache.org/confluence/display/FLINK/FLIP-168%3A+Speculative+Execution+for+Batch+Job#FLIP168:SpeculativeExecutionforBatchJob-NointegrationwithFlink'swebUI [2] https://cwiki.apache.org/confluence/display/FLINK/FLIP-245%3A+Source+Supports+Speculative+Execution+For+Batch+Job