Farhan Khan created HDFS-15336:
----------------------------------
Summary: GC metrics are not working with Telegraf's prometheus
Key: HDFS-15336
URL: https://issues.apache.org/jira/browse/HDFS-15336
Project: Hadoop HDFS
Issue Type: Bug
Affects Versions: 3.2.1
Reporter: Farhan Khan
The GC related metric particularly GCTimeMillis is reported as counter via
StatsD. Due to this when we use Telegraf's Prometheus to observe the metric it
shows unexpected behavior. Telegraf's default behavior is to increment the
counter.
https://github.com/apache/hadoop/blob/a0da1ec01051108b77f86799dd5e97563b2a3962/hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/metrics2/source/JvmMetrics.java#L128
To see the discrepancy in the metrics you could compare the logs reported by
JVM with GClog enabled and the metrics at Prometheus interface.
To fix this either it should be changed to gauge or it should support
Prometheus format.
As far as I know, HDFS does not support Prometheus yet.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]