Hi! I'm looking into integrating Flink into our stack and one of the requirements is to report metrics to an internal system. The current Accumulators are not adequate to provide visibility that we need to run such a system in production. We want much more information about the internal cluster state and ability to calculate aggregates ourselves. The core reporting API accepts a metric name, metric type (gauge, counter, timer) and a set of key value pairs that act as dimensions.
The ideal solution for us would report the metrics through such API and provide default binding to existing Accumulators, but allow overriding it to our internal reporting client. Is it something that could be added to the Flink or there are other plans for monitoring? Thanks! Maxim.