[ https://issues.apache.org/jira/browse/FLINK-7894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17194263#comment-17194263 ]
Matthias commented on FLINK-7894: --------------------------------- [~chesnay] Does these feature is still valuable? Should we leave this issue open? > Improve metrics around fine-grained recovery and associated checkpointing > behaviors > ----------------------------------------------------------------------------------- > > Key: FLINK-7894 > URL: https://issues.apache.org/jira/browse/FLINK-7894 > Project: Flink > Issue Type: Improvement > Components: Runtime / Metrics > Affects Versions: 1.3.2, 1.4.0 > Reporter: Zhenzhong Xu > Priority: Major > > Currently, the only metric around fine-grained recovery is "task_failures". > It's a very high level metric, it would be nice to have the following > improvements: > * Allows slice and dice into which tasks were restarted. > * Recovery duration. > * Recovery associated checkpoint behaviors: cancels, failures, etc -- This message was sent by Atlassian Jira (v8.3.4#803005)