Hi We have a 3 Kafka brokers setup on 0.10.2.1. We have a requirement in our company environment that we have to first stop our 3 Kafka Broker setup, then do some operations stuff that takes about 1 hours, and then bring up Kafka (version 1.1) brokers again.
In order to achieve this, we issue: 1. Run *bin/kafka-server-stop.sh* at the same time on all three brokers. 2. Do operations on our environment for about 1 hour. 3. Run bin/kafka-server.-start.sh at the same time on all three brokers. Upon start, we observe that leadership for lot of partition is messed up. The leadership shows up as -1 for lot of partitions. And ISR has no servers. Because of this our Kafka cluster is unusable, and even restart of brokers doesn't help. 1. Could it be because we are not doing rolling stop ? 2. What's the best way to do rollling stop ? Please advise. Thanks. R