[ https://issues.apache.org/jira/browse/KAFKA-4685?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15834209#comment-15834209 ]
Sinóros-Szabó Péter commented on KAFKA-4685: -------------------------------------------- Yes. I was able to check ZK only 1 hour after the issue, at about 8:00. At that time and still now: {code} WatchedEvent state:SyncConnected type:None path:null [zk: localhost:2181(CONNECTED) 0] ls /brokers/ids [] [zk: localhost:2181(CONNECTED) 1] ls /controller [] [zk: localhost:2181(CONNECTED) 2] {code} I have checked the zookeeper logs and those show some error that I do not understand (around 6:51), so it may happen that this is a ZK issue and Kafka node managed to register themself successfully, but ZK forgot some if the data. > All partitions offline, no conroller znode in ZK > ------------------------------------------------ > > Key: KAFKA-4685 > URL: https://issues.apache.org/jira/browse/KAFKA-4685 > Project: Kafka > Issue Type: Bug > Reporter: Sinóros-Szabó Péter > Attachments: kafka-0-logs.zip, kafka-1-logs.zip, kafka-2-logs.zip, > zookeeper-logs.zip > > > Setup: 3 Kafka 0.11.1.1 nodes on kubernetes (in AWS), and another 3 nodes of > Zookeeper 3.5.2-alpha also in kubernetes (in AWS). > At 2017-01-23 06:51 ZK sessions expired. It seems from the logs that kafka-2 > was elected as the new controller, but I am not sure how to read that logs. > I've checked the ZK data and both the /controller is empty and also the > /brokers/ids is empty. Kafka reports that all partitions are offline, > although it seems to be working because messages are coming and going. > We are using an alpha version, I know that it may be a problem, but I suppose > that Kafka should see that there is not any node registered as controller. > I have attached the Kafka and ZK logs -- This message was sent by Atlassian JIRA (v6.3.4#6332)