[ 
https://issues.apache.org/jira/browse/FLINK-34025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Emre Kartoglu updated FLINK-34025:
----------------------------------
    Description: 
*Problem:* Currently users have to click on every operator and check how much 
data each subtask is processing to see if there is data skew. This is 
particularly cumbersome and error-prone for jobs with big job graphs. Data skew 
is an important metric that should be more visible.

 

*Proposed solution:*
 * Show a data skew score on each operator (see screenshot below). This would 
be an improvement, but would not be sufficient. As it would still not be easy 
to see the data skew score for jobs with very large job graphs (it'd require a 
lot of zooming in/out).
 * Show data skew score for each operator under a new tab. See screenshot below 
 !skew_tab.png|width=1226,height=719! .

 

!skew_proposal.png|width=845,height=253!

  was:
*Problem:* Currently users have to click on every operator and check how much 
data each subtask is processing to see if there is data skew. This is 
particularly cumbersome and error-prone for jobs with big job graphs. Data skew 
is an important metric that should be more visible.

 

*Proposed solution:*
 * Show a data skew score on each operator (see screenshot below). This would 
be an improvement, but would not be sufficient. As it would still not be easy 
to see the data skew score for jobs with very large job graphs (it'd require a 
lot of zooming in/out).
 * Show data skew score for each operator under a new tab. See screenshot below 
!skew_tab.png! .

 

!skew_proposal.png!


> Show data skew score on Flink Dashboard
> ---------------------------------------
>
>                 Key: FLINK-34025
>                 URL: https://issues.apache.org/jira/browse/FLINK-34025
>             Project: Flink
>          Issue Type: New Feature
>          Components: Runtime / Web Frontend
>    Affects Versions: 1.19.0
>            Reporter: Emre Kartoglu
>            Priority: Major
>              Labels: dashboard
>         Attachments: skew_proposal.png, skew_tab.png
>
>
> *Problem:* Currently users have to click on every operator and check how much 
> data each subtask is processing to see if there is data skew. This is 
> particularly cumbersome and error-prone for jobs with big job graphs. Data 
> skew is an important metric that should be more visible.
>  
> *Proposed solution:*
>  * Show a data skew score on each operator (see screenshot below). This would 
> be an improvement, but would not be sufficient. As it would still not be easy 
> to see the data skew score for jobs with very large job graphs (it'd require 
> a lot of zooming in/out).
>  * Show data skew score for each operator under a new tab. See screenshot 
> below  !skew_tab.png|width=1226,height=719! .
>  
> !skew_proposal.png|width=845,height=253!



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to