I am encountering some issues with master/slave failover with ActiveMQ 5.8.0 running on RHEL and would appreciate some recommendations/help from the community if possible. I have tried searching but nothing seems to resolve my issue. Using the ActiveMQ example apps (per http://activemq.apache.org/examples.html) I setup the consumer and producer using the failover transport and then walked through a failure on master. Steps to reproduce, along with the commands are included below.
The consumer starts eating the messages that the producer sends to my TEST.FOO queue. Next, when I kill the ActiveMQ master broker, I see that the the producer reconnects to the slave/new-master and will continue sending messages where it left off. However, the consumer re-connects but then fails to receive any messages from the slave/new-master after failover. I see that the TEST.FOO queue was (re)created on the slave/new-master after the failover takes place. The consumer reports the following; [java] 00:00:35 WARN Transport (tcp://master_IP:61616) failed, reason: java.net.SocketException: Connection reset, attempting to automatically reconnect [java] 00:00:39 INFO Successfully reconnected to tcp://slave:61616 Steps to reproduce: 1. Start ActiveMQ broker master and slave 2. Start ActiveMQ producer and consumer ( parenthesis and ampersands are escaped for good measure, otherwise BASH complains as expected ) ant producer -Durl=failover://\(tcp://master:61616,tcp://slave:61616\)?randomize=false\&timeout=3000 -Dtopic=false -DtimeToLive=30000 -Ddurable=false -Dmax=80000 ant consumer -Durl=failover://\(tcp://master:61616,tcp://slave:61616\)?randomize=false\&timeout=3000 -Dtopic=false -Ddurable=false -Dmax=80000 3. Kill ActiveMQ master and observe ActiveMQ slave takeover. Look for string "Becoming the master on dataSource: org.apache.commons.dbcp.BasicDataSource@2482c23d" in ActiveMQ logs located on the slave. 4. Observe timeout/reconnect operations in both the consumer and producer that was started in step #2. The producer will reconnect Has anyone encountered this issue before with a two-node master/slave implementation of ActiveMQ? If so, what was the resolution? Best regards, -David Laube -- View this message in context: http://activemq.2283324.n4.nabble.com/Cannot-consume-messages-on-queue-after-master-slave-failover-in-ActiveMQ-5-8-0-tp4672708.html Sent from the ActiveMQ - User mailing list archive at Nabble.com.