Help understanding what happened

2016-03-20 Thread Scott Reynolds
In a test in staging environment, we kill -9 the broker. It was started back up by runit and started recovering. We are seeing errors like this: WARN Found an corrupted index file, /mnt/services/kafka/data/TOPIC-17/16763460.index, deleting and rebuilding index... (kafka.log.Log) The f

Re: Help understanding what happened

2016-03-19 Thread Jay Kreps
If you hard kill the broker when it restarts it doesn't know the status of it's on-disk files, it will need to run though the last log segment to validate the checksums of messages and rebuild the index off this to ensure consistency. (Why does it need to do this validation? Because in the event of