I implemented some custom Prometheus metrics that were working on
1.16.2, with my configuration

metrics.reporter.prom.factory.class:
org.apache.flink.metrics.prometheus.PrometheusReporterFactory
metrics.reporter.prom.port: 9999

I could see both Flink metrics and my custom metrics on port 9999 of
my task managers

After upgrading to 1.17.1, using the same configuration, I can see
only the FLink metrics on port 9999 of the task managers,
the custom metrics are getting lost somewhere.

The release notes for 1.17 mention
https://issues.apache.org/jira/browse/FLINK-24235
that removes instantiating reporters by name and forces using a
factory, which I was already doing in 1.16.2. Do I need to do
anything extra after those changes so my metrics are aggregated with
the Flink ones?

I am also seeing this error message on application startup (which I
was already seeing in 1.16.2): "Multiple implementations of the same
reporter were found in 'lib' and/or 'plugins' directories for
org.apache.flink.metrics.prometheus.PrometheusReporterFactory. It is
recommended to remove redundant reporter JARs to resolve used
versions' ambiguity." Could that also explain the missing metrics?

Thanks,

Javier Vegas

Reply via email to