Thanks for all the help. I removed the chaos monkey and started a new run. I don't think I will see any errors this model has been running in my production for 2 years. The only difference is that this is using a newer version of ActiveMQ and LevelDB on a Linux cluster.
Granted I have not had to many outages in production and it's a single server. (only due to some REST requirements) :( Short term my goal is to move to a more robust model then over time move to a more full featured ESB (maybe Service Mix or Fuse etc.) Production today: ActiveMQ 5.10 running on Windows Server with KahaDB. New Redhat cluster: ActiveMQ 5.13.1 running zookeeper / leveldb I looked for any DLQ or other error queues and I don't see any messages. I attached my ActiveMQ conf and zookeeper conf for reference. ActiveMQFourm.zip <http://activemq.2283324.n4.nabble.com/file/n4708716/ActiveMQFourm.zip> Thanks -- View this message in context: http://activemq.2283324.n4.nabble.com/Help-with-a-Failover-testing-that-shows-missing-messages-tp4707916p4708716.html Sent from the ActiveMQ - User mailing list archive at Nabble.com.