[ 
https://issues.apache.org/jira/browse/KAFKA-6588?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16377364#comment-16377364
 ] 

Navina Ramesh commented on KAFKA-6588:
--------------------------------------

[~wushujames] `time-since-last-run-ms` is a useful metric. However, it does not 
directly indicate the health of the cleaner thread. In the past, we relied on 
`max-dirty-ratio` for setting up alerts. The problem with both these metrics is 
that it is hard to tune the alerting thresholds, esp. when there are topics 
with varying loads. Eventually, these just became noise more than alert. Hence, 
we decided to add a more direct metric for monitoring the log cleaner thread, 
instead of relying on indirect metrics.

> Add a metric to monitor live log cleaner thread
> -----------------------------------------------
>
>                 Key: KAFKA-6588
>                 URL: https://issues.apache.org/jira/browse/KAFKA-6588
>             Project: Kafka
>          Issue Type: Bug
>            Reporter: Navina Ramesh
>            Assignee: Navina Ramesh
>            Priority: Minor
>
> We want to have a more direct metric to monitor the log cleaner thread. 
> Hence, adding a simple metric in `LogCleaner.scala`. 
> Additionally, making a minor change to make sure the correct offsets are 
> logged in `LogCleaner#recordStats` 
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to