Hi,

We have a 22-node Kafka 3.3.1 cluster on K8s. All data is sent with null 
partitionId and null key from 20 Java producers, so it should be distributed 
evenly across partitions. All was good for days, but a couple hours ago, broker 
21 started receiving about 2x the data of the other brokers for a few topics 
(but not all). These topics are all 1x replicated and the 96 partitions are 
distributed evenly across brokers (each broker has 4 or 5 partitions). This was 
detected in Grafana, but I can also see the offsets increasing much faster for 
the partitions owned by broker 21 in KafkaOffsetsShell. What could cause this? 
I didn't see anything unusual in the broker 21 logs or the controller logs.

Looking back, I noticed that broker 11 also becomes a bit unbalanced each day 
at the time when we are processing the most data, but it is only 10-15% higher 
than the others. All other brokers are quite even, including broker 21 until 
today.

Any ideas on what I can check? Unfortunately we'll probably have to restart 
Kafka and/or the producers pretty soon.

Thanks a lot!
Meg

Reply via email to