If ssh fails on the broker machine, I'm not why the broker would be in a state where it maintains it's registration in zookeeper? If so, it will automatically be marked dead and will not get elected as the leader. Have you seen a case where such a broker became the leader? Could you elaborate more on the type of failure that you describe?
On Tue, Jun 24, 2014 at 6:20 PM, Lung, Paul <pl...@ebay.com> wrote: > Hi All, > > Is there anyway to blacklist brokers? Sometimes we run into situations > where there are certain hardware failures on a broker machine, and the > machines goes into a “half dead” state. The broker process is up and > participating in the cluster, but it can’t actually transmit messages > properly. Sometimes, even SSH fails on the machines, and we can’t log in to > shutdown the broker process. The only way to resolve this issue at this > point is to shutdown the machine, which is not always easy in a large data > center scenario. The consequences are disastrous if this half dead broker > is elected as the leader. So in this situation, it would be nice to be able > to quickly black list a broker. > > Thank you. > Paul Lung >