Thanks for writing this FLIP and initiating the discussion, Gen, Yun and Junhan!
It will be very useful to have these improvements on the web UI for
speculative execution users, allowing them to know what is happening.
I just have a few comment regarding the design details:

1. Can we also show "Blocked Slots" in the resource card, so that users
can easily figure out how many slots are available/blocked/in-use?
2. I think "speculative-attempts" is not accurate, because the
root/fastest current can be a specualtive execution attempt, and in
this case "speculative-attempts" will contain the intial execution
attempt. How about name it as "other-concurrent-attempts"?
3. I think ArchivedSpeculativeExecutionVertex is not necessarily
needed. We can rework the ArchivedExecutionVertex to contains a set of
current execution attempts. The set will have one only element in
non-speculative cases though. In this way, we can have a unified
processing for ArchivedExecutionVertex in speculative/non-speculative
cases.

Thanks,
Zhu

Gen Luo <luogen...@gmail.com> 于2022年7月5日周二 15:10写道:

>
> Hi everyone,
>
> The speculative execution for batch jobs has been proposed and accepted in
> FLIP-168[1], as well as the related blocklist mechanism in FLIP-224[2]. As
> a follow-up step, the Flink Web UI needs to be enhanced to display the
> related information if the speculative execution mechanism is enabled.
>
> Junhan Yang, Yun Gao and I would like to start the discussion about the Web
> UI enhancement and the corresponding REST API changes in FLIP-249[3],
> including:
> - show the speculative executions in the subtask list and the backpressure
> page, where the fastest is shown directly while others are folded;
> - show the number of the blocked task managers in the Task Managers and
> Slots card, when the number is not 0;
> - show the BLOCKED label in the task manager list and the task manager
> detail page for the blocked task managers.
>
> All changes expect to be transparent to users who don’t use speculative
> execution.
>
> Please see the FLIP page[3] for more details. Looking forward to your
> feedback.
>
> [1]
> https://cwiki.apache.org/confluence/display/FLINK/FLIP-168%3A+Speculative+Execution+for+Batch+Job
> [2]
> https://cwiki.apache.org/confluence/display/FLINK/FLIP-224%3A+Blocklist+Mechanism
> [3]
> https://cwiki.apache.org/confluence/display/FLINK/FLIP-249%3A+Flink+Web+UI+Enhancement+for+Speculative+Execution

Reply via email to