I would like to monitor how far behind our consumer groups are using
wall-clock time in addition to the normal integer offset lag. This way
services that have tight latency SLAs can alert when a consumer falls
behind by N minutes.

Is there a way to do this by querying the cluster/brokers?

It's easy to get the highwater offset time, and I can fetch the consumer
offset as an integer, but I can't figure out how to derive the consumer
offset time.

It seems there is no way to directly fetch the offset as a time nor to
convert the integer offset to a time. Am I missing something, or is this
truly impossible from the broker-side?

We could do this by instrumenting all our consumers, but given how our
teams are structured, it'd be much simpler to monitor this by querying the
cluster. For example, if someone spins up a new consumer, we immediately
have this metric for their service.

Cheers,
Jeff



-- 

*Jeff Widman*
jeffwidman.com <http://www.jeffwidman.com/> | 740-WIDMAN-J (943-6265)
<><

Reply via email to