We had a consumer crash and restart itself a bunch of times, and then the
group got stuck in state CompletingRebalance. All of the consumers were
dead (I checked that there are no processes running on the host in the
output of kafka-consumer-groups.sh --describe --group group-name
--members), but the group was still rebalancing. If I tried to start up a
consumer, it just timed out trying to join the group.

I tried to delete the group, but that fails with

* Group 'trackers-etl' could not be deleted due to:
java.util.concurrent.ExecutionException:
org.apache.kafka.common.errors.GroupNotEmptyException: The group is not
empty.

I ended up restarting the broker that was the leader for that group; that
fixed the issue but is obviously a pretty heavy-weight solution.

There was nothing interesting in the logs on that broker; the last thing it
logged about that group was:

[2020-04-13 21:37:20,308] INFO [GroupCoordinator 6]: Stabilized group
trackers-etl generation 4987 (__consumer_offsets-29)
(kafka.coordinator.group.GroupCoordinator)

Has anyone seen this before? Should I file a JIRA ticket? Was there a
better process than restarting the broker?
-- 
James Brown
Systems Engineer

Reply via email to