Hi everyone,

We have a Kafka cluster with 5 brokers with all topics having at least 2
replication factor. We have multiple Kafka consumers applications running
on this cluster. Most of these consumers are build using consumer APIs and
quite recently we have started using Stream applications.

We are facing a really weird issue. Just Sometimes it happens that our
Kafka cluster breaks down, By breaking down I mean that consumers and
producers start throwing disconnection exception and all of them just stop.

We use debezium connector to push Postgres events to Kafka topics. Debezium
throws the error below:
[image: image.png]


Kafka broker throws the error below:
COORDINATOR_NOT_AVAILABLE
[image: image.png]


Error on the consumer side :

[image: image.png]


In order to fix, I stop the disconnected broker and everything fixes
itself. Debezium starts flushing messages and all consumers start working
normally.  I bring the disconnected broker up and everything works as
before without any problem.

I don't understand a few things here :


   1. what could be the reason behind this disconnection exception. Even if
   one of the broker was somehow disconnected, Isn't kafka suppose to handle
   it in a cluster where all topics have a replication factor of 2.
   2. It appears that the malfunctioning broker was in a state where it was
   neither disconnected nor connected to the cluster. I could still see the
   broker visible in Kafka manager with zero bytes In, while it was
   disconnected from all the producers and consumers.
   3. Weirdly, I have noticed that this situation usually occurs when I
   start the multiple consumers of the stream application. Not sure about this
   as this error has only occurred a few times. It happened twice today and
   both the times I started 3 consumers of the same stream application.


Can anyone help me debug this problem. I don't know where to look for
possible issues with our cluster or stream application. I am attaching
streams config and stream application code for your reference.
Please feel free to ask for any more details.


Stream config :
[image: image.png]


Stream application code : https://codeshare.io/Gq6pLB

-- 
Thanks,

Ankur Rana
Software Developer
FarEye

Reply via email to