Hi,

I'm running some tests with Kafka (4 broker setup, version 3.2.0) using 
kafka-consumer-perf-test.sh. After starting it multiple times in sequence with 
e.g.

  kafka-consumer-perf-test.sh --bootstrap-server <BROKER_IP>:9092 --topic 
test-topic --messages 5000 --show-detailed-stats --print-metrics

then it often works, but sometimes times out without consuming messages. The 
test-topic topic has three partitions with three replicas each, and as part of 
a previous test I ran a partition reassignment on this topic from brokers 0,1,2 
to 1,2,3.


The detailed metrics in the end include

consumer-coordinator-metrics:failed-rebalance-rate-per-hour:{client-id=perf-consumer-client}
 : 8635.055
consumer-coordinator-metrics:failed-rebalance-total:{client-id=perf-consumer-client}
         : 95.000

and by running it with the logging level set to DEBUG, I repeatedly get the 
following lines in the log (broker IP redacted):

[2022-08-03 17:29:03,517] INFO [Consumer clientId=perf-consumer-client, 
groupId=perf-consumer-61424] Requesting disconnect from last known coordinator 
<IP_ADDRESS>:9092 (id: 2147483645 rack: null) 
(org.apache.kafka.clients.consumer.internals.ConsumerCoordinator)
[2022-08-03 17:29:03,614] INFO [Consumer clientId=perf-consumer-client, 
groupId=perf-consumer-61424] Discovered group coordinator <IP_ADDRESS>:9092 
(id: 2147483645 rack: null) 
(org.apache.kafka.clients.consumer.internals.ConsumerCoordinator)
[2022-08-03 17:29:03,615] INFO [Consumer clientId=perf-consumer-client, 
groupId=perf-consumer-61424] (Re-)joining group 
(org.apache.kafka.clients.consumer.internals.ConsumerCoordinator)
[2022-08-03 17:29:03,617] INFO [Consumer clientId=perf-consumer-client, 
groupId=perf-consumer-61424] Group coordinator <IP_ADDRESS>:9092 (id: 
2147483645 rack: null) is unavailable or invalid due to cause: error response 
NOT_COORDINATOR.isDisconnected: false. Rediscovery will be attempted. 
(org.apache.kafka.clients.consumer.internals.ConsumerCoordinator)
[2022-08-03 17:29:03,617] INFO [Consumer clientId=perf-consumer-client, 
groupId=perf-consumer-61424] Requesting disconnect from last known coordinator 
<IP_ADDRESS>:9092 (id: 2147483645 rack: null) 
(org.apache.kafka.clients.consumer.internals.ConsumerCoordinator)
[2022-08-03 17:29:03,617] INFO [Consumer clientId=perf-consumer-client, 
groupId=perf-consumer-61424] JoinGroup failed: This is not the correct 
coordinator. Marking coordinator unknown. Sent generation was 
Generation{generationId=-1, memberId='', protocol='null'} 
(org.apache.kafka.clients.consumer.internals.ConsumerCoordinator)

Am I missing some obvious problem here? Is there something I could look for in 
the broker logs to give me a hint of what is causing this?


Thanks in advance,
Afonso

Reply via email to