> - when I "restart" bookies I issue a kill -9 (I think this could be the > reason why I can't reproduce the issue on testcases) With a clean shutdown of bookies we close the channels, and it should do the tcp shutdown handshake. -9 will kill the process before it gets to do any of that, but the kernel will kill the socket. Pretty sure it doesn't do the fin-ack handshake. I suspect that netty should be throwing up a different exception in this case.
You can't do a kill -9 with unit tests, but with integration tests it's possible. -Ivan