[ https://issues.apache.org/jira/browse/KAFKA-1314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13941968#comment-13941968 ]
alex kamil commented on KAFKA-1314: ----------------------------------- it seems to be back online now bin/kafka-list-topic.sh --zookeeper localhost --topic mytopic topic: mytopic partition: 0 leader: 1 replicas: 1 isr: 1 topic: mytopic partition: 1 leader: 2 replicas: 2 isr: 2 previous error in state-change.log [2014-03-18 16:40:19,242] ERROR Controller 3 epoch 3 initiated state change for partition [mytopic,0] from OfflinePartition to OnlinePartition failed (state.change.logger) kafka.common.NoReplicaOnlineException: No replica for partition [mytopic,0] is alive. Live brokers are: [Set(3)], Assigned replicas are: [List(1)] at kafka.controller.OfflinePartitionLeaderSelector.selectLeader(PartitionLeaderSelector.scala:60) at kafka.controller.PartitionStateMachine.electLeaderForPartition(PartitionStateMachine.scala:304) at kafka.controller.PartitionStateMachine.kafka$controller$PartitionStateMachine$$handleStateChange(PartitionStateMachine.scala:153) at kafka.controller.PartitionStateMachine$$anonfun$triggerOnlinePartitionStateChange$1.apply(PartitionStateMachine.scala:91) at kafka.controller.PartitionStateMachine$$anonfun$triggerOnlinePartitionStateChange$1.apply(PartitionStateMachine.scala:89) at scala.collection.mutable.HashMap$$anonfun$foreach$1.apply(HashMap.scala:80) at scala.collection.mutable.HashMap$$anonfun$foreach$1.apply(HashMap.scala:80) at scala.collection.Iterator$class.foreach(Iterator.scala:631) at scala.collection.mutable.HashTable$$anon$1.foreach(HashTable.scala:161) at scala.collection.mutable.HashTable$class.foreachEntry(HashTable.scala:194) at scala.collection.mutable.HashMap.foreachEntry(HashMap.scala:39) at scala.collection.mutable.HashMap.foreach(HashMap.scala:80) at kafka.controller.PartitionStateMachine.triggerOnlinePartitionStateChange(PartitionStateMachine.scala:89) at kafka.controller.PartitionStateMachine.startup(PartitionStateMachine.scala:64) at kafka.controller.KafkaController.onControllerFailover(KafkaController.scala:242) at kafka.controller.KafkaController$$anonfun$1.apply$mcV$sp(KafkaController.scala:111) at kafka.server.ZookeeperLeaderElector.elect(ZookeeperLeaderElector.scala:62) at kafka.server.ZookeeperLeaderElector.startup(ZookeeperLeaderElector.scala:46) at kafka.controller.KafkaController.startup(KafkaController.scala:470) at kafka.server.KafkaServer.startup(KafkaServer.scala:96) at kafka.server.KafkaServerStartable.startup(KafkaServerStartable.scala:34) at kafka.Kafka$.main(Kafka.scala:46) at kafka.Kafka.main(Kafka.scala) > recurring errors > ---------------- > > Key: KAFKA-1314 > URL: https://issues.apache.org/jira/browse/KAFKA-1314 > Project: Kafka > Issue Type: Bug > Affects Versions: 0.8.0 > Reporter: alex kamil > Priority: Critical > > we're getting hundreds of these errs with kafka 0.8 and topics become > unavailable after running for a few days > kafka error 1 (had to recreate the topic as it became unavailable after a few > days) > [2014-03-18 16:34:56,403] ERROR [KafkaApi-3] Error while fetchingmetadata for > partition [mytopic,0] (kafka.server.KafkaApis) > kafka.common.LeaderNotAvailableException: Leader not available for partition > [mytopic,0] > kafka error 2 > [2014-03-17 12:23:27,536] ERROR Closing socket for /<kafka consumer ip > address> because of error (kafka.network.Processor) > kafka.common.KafkaException: Wrong request type 768 > kafka error 3 > ERROR Closing socket for /<kafka consumer ip address> because of error > (kafka.network.Processor) > java.io.IOException: Connection reset by peer > kafka error 4 > ERROR Closing socket for /<kafka broker ip address> because of error > (kafka.network.Processor) > zookeeper error > 2014-03-18 16:40:02,794 [myid:3] - WARN > [QuorumPeer[myid=3]/0.0.0.0:2181:QuorumCnxManager@368] - Cannot open channel > to 1 at election address /<kafka broker ip address>:3888 > java.net.ConnectException: Connection refused -- This message was sent by Atlassian JIRA (v6.2#6252)