[ 
https://issues.apache.org/jira/browse/FLINK-20833?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17258774#comment-17258774
 ] 

Till Rohrmann commented on FLINK-20833:
---------------------------------------

Thanks for creating this ticket [~ZhenqiuHuang]. I like the idea in general. 
Before starting this effort, I think we need a bit more concrete proposal how 
to exactly do it and where to place it. I would suggest to not add it directly 
to the {{ExecutionGraph}} since this structure is already too overloaded with 
responsibilities. A starting pointer could be the {{ExecutionFailureHandler}} 
which is responsible for handling execution failures.

> Expose pluggable interface for  exception analysis and metrics reporting in 
> Execution Graph
> -------------------------------------------------------------------------------------------
>
>                 Key: FLINK-20833
>                 URL: https://issues.apache.org/jira/browse/FLINK-20833
>             Project: Flink
>          Issue Type: Improvement
>          Components: Runtime / Coordination
>    Affects Versions: 1.12.0
>            Reporter: Zhenqiu Huang
>            Priority: Minor
>
> For platform users of Apache flink, people usually want to classify the 
> failure reason( for example user code, networking, dependencies and etc) for 
> Flink jobs and emit metrics for those analyzed results. So that platform can 
> provide an accurate value for system reliability by distinguishing the 
> failure due to user logic from the system issues. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to