Hi,
In addition to the above suggestion, I think you can also check the broker
logs clearly. You should see the reason. For example, here's what I saw in
my logs:

[2021-02-26 16:38:45,777] INFO [GroupCoordinator 0]: Member
consumer-groupId-1-7e24136f-8d7e-4303-a150-32dae06c18c0 in group groupId
has failed, removing it from the group
(kafka.coordinator.group.GroupCoordinator)
[2021-02-26 16:38:45,779] INFO [GroupCoordinator 0]: Preparing to rebalance
group groupId in state PreparingRebalance with old generation 2
(__consumer_offsets-18) (*reason: removing member
consumer-groupId-1-7e24136f-8d7e-4303-a150-32dae06c18c0 on heartbeat
expiration*) (kafka.coordinator.group.GroupCoordinator)

So, you should know that it's because heartbeat expired in my case.

Thanks.
Luke

On Mon, Jul 5, 2021 at 9:56 AM Shilin Wu <s...@confluent.io.invalid> wrote:

> There might be many causes, but from broker's point of view: This consumer
> is dead.
>
> Consumers are considered dead when one of the following happended:
>
> 1. didn't send out heartbeat for session.timeout.ms(defaults to 10
> seconds) period of time. (Consumers sent heart beat ever
> heartbeat.interval.ms). Did you configure this reasonably? Note that
> heartbeat runs in a background thread, so your app logic should not affect
> it.
>
> 2. didn't poll for max.poll.interval.ms (defaults to 5 minutes) period of
> time. Sometimes although the consumer is sending heartbeat (by the bg
> thread), the main thread may be dead. Is your application polled too much
> that couldn't be processed in 5 minutes so next pool is delayed by more
> than 5 minutes?
>
>
>
> [image: Confluent] <https://www.confluent.io>
> Wu Shilin
> Solution Architect
> +6581007012
> Follow us: [image: Blog]
> <
> https://www.confluent.io/blog?utm_source=footer&utm_medium=email&utm_campaign=ch.email-signature_type.community_content.blog
> >[image:
> Twitter] <https://twitter.com/ConfluentInc>[image: LinkedIn]
> <https://www.linkedin.com/company/confluent/>[image: Slack]
> <https://slackpass.io/confluentcommunity>[image: YouTube]
> <https://youtube.com/confluent>
> [image: Kafka Summit] <https://www.kafka-summit.org/>
>
>
> On Sun, Jul 4, 2021 at 6:38 PM sunil chaudhari <
> sunilmchaudhar...@gmail.com>
> wrote:
>
> > Hi,
> > There is something called as heartbet consumer thread.
> > This threads running on consumer keeps sending heartbeats at regular
> > interval as per the setting heartbeat.interval.ms. It keeps on telling
> > broker that I am very much alive.
> >
> >
> >
> https://docs.confluent.io/platform/current/installation/configuration/consumer-configs.html#consumerconfigs_heartbeat.interval.ms
> >
> > There is another setting session.timeout.ms. If the time duration
> between
> > 2
> > heartbeats exceeds the time limit set in this setting then broker removes
> > the client(consumer) from the group.
> >
> > Please read above settings in documentation.
> >
> > I hope you can co-relate this with your problem.
> >
> > Regards,
> > Sunil.
> >
> >
> >
> >
> > On Sun, 4 Jul 2021 at 1:36 PM, Kafka Life <lifekafka...@gmail.com>
> wrote:
> >
> > > Dear kafka Experts
> > >
> > > Could one of you please help to explain what this below log in broker
> > > instance mean..and what scenarios it would occur when there is no
> change
> > > done .
> > >
> > >  INFO [GroupCoordinator 9610]: Member
> > >
> > >
> >
> webhooks-retry-app-840d3107-833f-4908-90bc-ea8c394c07c3-StreamThread-2-consumer-f87c3b85-5aa1-40f5-a42f-58927421b89e
> > > in group webhooks-retry-app has failed, removing it from the group
> > > (kafka.coordinator.group.GroupCoordinator)
> > >
> > >
> > >  INFO [GroupCoordinator 9611]: Member
> > > cm.consumer.9-d65d39d3-703f-408b-bf4b-fbf087321d8c in group
> > > cm_group_apac_sy_cu_01 has failed, removing it from the group
> > > (kafka.coordinator.group.GroupCoordinator)
> > >
> > >
> > > Please help to explain .
> > >
> > > thanks
> > >
> >
>

Reply via email to