Re: Recovering from Kafka NoReplicaOnlineException with one node

2015-08-10 Thread Mike Thomsen
Thanks, I'll give that a shot. I noticed that our configuration used the default timeouts for session and sync, so I upped those zookeeper configuration settings for kafka as well. On Mon, Aug 10, 2015 at 4:37 PM, Gwen Shapira wrote: > Maybe it is not ZooKeeper itself, but the Broker connection

Re: Recovering from Kafka NoReplicaOnlineException with one node

2015-08-10 Thread Gwen Shapira
Maybe it is not ZooKeeper itself, but the Broker connection to ZK timed-out and caused the controller to believe that the broker is dead and therefore attempted to elect a new leader (which doesn't exist, since you have just one node). Increasing the zookeeper session timeout value may help. Also,

Recovering from Kafka NoReplicaOnlineException with one node

2015-08-10 Thread Mike Thomsen
We have a really simple Kafka set up in our development lab. It's just one node. Periodically, we run into this error: [2015-08-10 13:45:52,405] ERROR Controller 0 epoch 488 initiated state change for partition [test-data,1] from OfflinePartition to OnlinePartition failed (state.change.logger) kaf