Re: [Pacemaker] [Openais] very slow pacemaker/corosync shutdown

Lists Thu, 19 Sep 2013 17:02:13 -0700

On 09/18/2013 06:49 PM, Andrew Beekhof wrote:

On 19/09/2013, at 8:25 AM, David Lang <da...@lang.hm> wrote:

What's the best way to see what it's getting stuck doing?

Log files.

Is there a good way to tell if this is a pacemaker or corosync problem (so I 
can drop one of the lists from the thread)?

Not without further information

We've had the same problem here, trying to get HA dns/named serviceworking. Works great for a day or so, then seizes up, simple commandslike `crm_standby -v true` timeout after 120 seconds, etc. We're testingfor release, and keep running into issues like this. At first wesuspected firewall issues, but even after confirmed operation andseveral hand-offs of HA services back and forth, it still dies within aday or so.

We're on CentOS 6/64 with yum packages augmented fromhttp://download.opensuse.org/repositories/network:/ha-clustering:/Stable/RedHat_RHEL-6/

with exclude=pacemaker* corosync*

In order to make the log files visible, I've snipped out a time periodduring which it becomes unresponsive visible athttp://hal.schoolpathways.com/details/

I don't know the exact moment, this is a test cluster and not beingmonitored by a netmon. Any other details I could provide that would beuseful/helpful?




_______________________________________________
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://bugs.clusterlabs.org

Re: [Pacemaker] [Openais] very slow pacemaker/corosync shutdown

Reply via email to