[ 
https://issues.apache.org/jira/browse/FLINK-20833?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17265123#comment-17265123
 ] 

Robert Metzger commented on FLINK-20833:
----------------------------------------

Thanks for your summary.
I haven't understood why you chose "ExecutionFailureClassifier" as the 
interface name? I wonder if "ExceptionListener" is a more suitable name.
Secondly, I don't understand why we need a default no-op operation.
Can't we just call the listener only if it's set?

How do you plan to load a custom ExceptionListener/ExecutionFailureClassifier 
implementation? Are users supposed to put a jar file with the implementation 
into Flink's classpath?

You mentioned that you want to send metrics in your own implementation: Do you 
want to use Flink's metrics system, or will your listener just establish a 
connection to a metrics system to report them there?

> Expose pluggable interface for  exception analysis and metrics reporting in 
> Execution Graph
> -------------------------------------------------------------------------------------------
>
>                 Key: FLINK-20833
>                 URL: https://issues.apache.org/jira/browse/FLINK-20833
>             Project: Flink
>          Issue Type: Improvement
>          Components: Runtime / Coordination
>    Affects Versions: 1.12.0
>            Reporter: Zhenqiu Huang
>            Priority: Minor
>
> For platform users of Apache flink, people usually want to classify the 
> failure reason( for example user code, networking, dependencies and etc) for 
> Flink jobs and emit metrics for those analyzed results. So that platform can 
> provide an accurate value for system reliability by distinguishing the 
> failure due to user logic from the system issues. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to