[ https://issues.apache.org/jira/browse/FLINK-20833?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17258774#comment-17258774 ]
Till Rohrmann commented on FLINK-20833: --------------------------------------- Thanks for creating this ticket [~ZhenqiuHuang]. I like the idea in general. Before starting this effort, I think we need a bit more concrete proposal how to exactly do it and where to place it. I would suggest to not add it directly to the {{ExecutionGraph}} since this structure is already too overloaded with responsibilities. A starting pointer could be the {{ExecutionFailureHandler}} which is responsible for handling execution failures. > Expose pluggable interface for exception analysis and metrics reporting in > Execution Graph > ------------------------------------------------------------------------------------------- > > Key: FLINK-20833 > URL: https://issues.apache.org/jira/browse/FLINK-20833 > Project: Flink > Issue Type: Improvement > Components: Runtime / Coordination > Affects Versions: 1.12.0 > Reporter: Zhenqiu Huang > Priority: Minor > > For platform users of Apache flink, people usually want to classify the > failure reason( for example user code, networking, dependencies and etc) for > Flink jobs and emit metrics for those analyzed results. So that platform can > provide an accurate value for system reliability by distinguishing the > failure due to user logic from the system issues. -- This message was sent by Atlassian Jira (v8.3.4#803005)