[ https://issues.apache.org/jira/browse/KAFKA-3879?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15340283#comment-15340283 ]
Ashish K Singh commented on KAFKA-3879: --------------------------------------- [~hachikuji] did you guys ever hit this? I am thinking of adding timeout to {{ConsumerCoordinator.maybeAutoCommitOffsetsSync}}. Wondering if we need a mechanism to make a retriable error, non-retriable after certain number of failures. > KafkaConsumer with auto commit enabled gets stuck when killed after broker is > dead > ---------------------------------------------------------------------------------- > > Key: KAFKA-3879 > URL: https://issues.apache.org/jira/browse/KAFKA-3879 > Project: Kafka > Issue Type: Bug > Affects Versions: 0.10.0.0 > Reporter: Ashish K Singh > Assignee: Ashish K Singh > Fix For: 0.10.0.1 > > > KafkaConsumer with auto commit enabled gets stuck when killed after broker is > dead. > * KafkaConsumer on close tries to close coordinator. > * Coordinator, if auto commit is enabled, tries to commit offsets > synchronously before closing. > * While trying to synchronously commit offsets, coordinator checks if > coordinator is alive by sending {{GroupCoordinatorRequest}}. As brokers are > dead, this returns {{NoAvailableBrokersException}}, which is a retriable > exception. > * Coordinator ready check enters into an infinite loop as it keeps retrying > to discover group coordinator. -- This message was sent by Atlassian JIRA (v6.3.4#6332)