Hi guys: we have used kafka-0.8.1 under three machines (broker 0,broker 1,broker 3). After running several months, some partions is out of sync; how to recover from this situation ?
topic desc: Topic:analyze PartitionCount:8 ReplicationFactor:3 Configs: Topic: analyze Partition: 0 Leader: 1 Replicas: 1,2,0 Isr: 1,2 Topic: analyze Partition: 1 Leader: 2 Replicas: 2,0,1 Isr: 2,1 Topic: analyze Partition: 2 Leader: 0 Replicas: 0,1,2 Isr: 1,2,0 Topic: analyze Partition: 3 Leader: 1 Replicas: 1,0,2 Isr: 1,2 Topic: analyze Partition: 4 Leader: 2 Replicas: 2,1,0 Isr: 2,1 Topic: analyze Partition: 5 Leader: 0 Replicas: 0,2,1 Isr: 2,1,0 Topic: analyze Partition: 6 Leader: 1 Replicas: 1,2,0 Isr: 1,2 Topic: analyze Partition: 7 Leader: 2 Replicas: 2,0,1 Isr: 2,1 zk info: [zk: localhost:2181(CONNECTED) 2] get /brokers/topics/analyze/partitions/0/state {"controller_epoch":1,"leader":1,"version":1,"leader_epoch":1,"isr":[1,2]} Here is the related log in borker 0 state-change.log ./state-change.log.2015-08-01:[2015-08-01 06:01:51,449] WARN Broker 0 received invalid LeaderAndIsr request with correlation id 5 from controller 2 epoch 2 with an older leader epoch 0 for partition [analyze,0], current leader epoch is 0 (state.change.logger) controller.log ./controller.log:[2015-08-01 06:01:51,395] WARN [Channel manager on controller 0]: Not sending request Name: StopReplicaRequest; Version: 0; CorrelationId: 39; ClientId: ; DeletePartitions: false; ControllerId: 0; ControllerEpoch: 1; Partitions: [analyze,0] to broker 0, since it is offline. (kafka.controller.ControllerChannelManager) Thanks and regards