Dear All, In our project(as part of kafka failover evaluation), we have a single cluster with five kafka nodes (five partition), three consumers (attached to single group) and single Zookeeper node. As soon as cluster startups, we see leader election per partition and each consumers discovers the group co-ordinator. Now, when manually shutdown kafka service in the co-ordinator node, entire cluster goes down (means no publish/subscribe of messages happening).
As per Kafka wiki co-ordinator design<https://cwiki.apache.org/confluence/display/KAFKA/Kafka+Detailed+Consumer+Coordinator+Design#KafkaDetailedConsumerCoordinatorDesign-10.OnCoordinatorFailover> and client-assignment<https://cwiki.apache.org/confluence/display/KAFKA/Kafka+Client-side+Assignment+Proposal#KafkaClient-sideAssignmentProposal-CoordinatorStateMachine>, we see coordinator failover handling as part of kafka cluster. Kindly let us know anyone in the community has encountered this earlier or any known solution available. Thanks & Regards, Krishnakumar G