Hey Nitay,
In terms of rocksDB metrics, 2.5.1 should have a number of debug level
metrics that could shed some light on the situation. Particularly I'd
recommend looking at WRITE_STALL_DURATION_AVG / WRITE_STALL_DURATION_TOTAL,
as well as some of the compaction metrics such as COMPACTION_TIME_MAX,
Hey Leah, Thanks for the response.
We are running Kafka 2.5.1 and if the topology will still be useful after
the next few sentences, i will share it with you (its messy!).
It happens on few partitions, and few internal topics - and it seems to be
kind of random which topics and which partitions ex
Hi Nitay,
What version of Kafka are you running? If you could also give the topology
you're using that would be great. Do you have a sense of if the lag is
happening on all partitions or just a few? Also if you're using rocksDB
there are some rocksDB metrics in newer versions of Kafka that could b