Ok,
So for now the fix seems to be to remove the latest version of corosync:
1.2.2-1.1 and install the older rpms 1.2.1-1
Here is what I did:
[r...@phys-ha01 corosync]# rpm -e --nodeps corosynclib corosync
[r...@phys-ha01 corosync]# rpm -ivh
http://www.clusterlabs.org/rpm/epel-5/x86_64/corosync-1.2.1-1.el5.x86_64.rpm
http://www.clusterlabs.org/rpm/epel-5/x86_64/corosynclib-1.2.1-1.el5.x86_64.rpm
[r...@phys-ha01 corosync]# /etc/init.d/corosync start
Starting Corosync Cluster Engine (corosync): [ OK ]
[r...@phys-ha01 corosync]# crm status
============
Last updated: Thu May 27 11:46:53 2010
Stack: openais
Current DC: NONE
2 Nodes configured, 2 expected votes
3 Resources configured.
============
OFFLINE: [ phys-ha02.physics.gatech.edu phys-ha01.physics.gatech.edu ]
Diego
Steven Dake wrote:
This is a known issue on some platforms, although the exact cause is
unknown. I have tried RHEL 5.5 as well as CentOS 5.5 with clusterrepo
rpms and been unable to reproduce. I'll keep looking.
Regards
-steve
On 05/27/2010 06:07 AM, Diego Remolina wrote:
Hi,
I was running the old rpms from the opensuse repo and wanted to change
over to the latest packages from the clusterlabs repo in my RHEL 5.5
machines.
Steps I took
1. Disabled the old repo
2. Set the nodes to standby (two node drbd cluster) and turned of openais
3. Enabled the new repo.
4. Performed an update with yum -y update which replaced all packages.
5. The configuration file for ais was renamed openais.conf.rpmsave
6. I ran corosync-keygen and copied the key to the second machine
7. I copied the file openais.conf.rpmsave to /etc/corosync/corosync.conf
and modified it by removing the service section and moving that to
/etc/corosync/service.d/pcmk
8. I copied the configurations to the other machine.
9. When I try to start either openais or corosync with the init scripts
I get a failure and nothing that can really point me to an error in the
logs.
Updated packages:
May 26 14:29:32 Updated: cluster-glue-libs-1.0.5-1.el5.x86_64
May 26 14:29:32 Updated: resource-agents-1.0.3-2.el5.x86_64
May 26 14:29:34 Updated: cluster-glue-1.0.5-1.el5.x86_64
May 26 14:29:34 Installed: libibverbs-1.1.3-2.el5.x86_64
May 26 14:29:34 Installed: corosync-1.2.2-1.1.el5.x86_64
May 26 14:29:34 Installed: librdmacm-1.0.10-1.el5.x86_64
May 26 14:29:34 Installed: corosynclib-1.2.2-1.1.el5.x86_64
May 26 14:29:34 Installed: openaislib-1.1.0-2.el5.x86_64
May 26 14:29:34 Updated: openais-1.1.0-2.el5.x86_64
May 26 14:29:34 Installed: libnes-0.9.0-2.el5.x86_64
May 26 14:29:35 Installed: heartbeat-libs-3.0.3-2.el5.x86_64
May 26 14:29:35 Updated: pacemaker-libs-1.0.8-6.1.el5.x86_64
May 26 14:29:36 Updated: heartbeat-3.0.3-2.el5.x86_64
May 26 14:29:36 Updated: pacemaker-1.0.8-6.1.el5.x86_64
Apparently corosync is sec faulting when run from the command line:
# /usr/sbin/corosync -f
Segmentation fault
Any help would be greatly appreciated.
Diego
_______________________________________________
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker
Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
--
Diego Julian Remolina
System Administrator - Systems Support Specialist IV
School of Physics
Georgia Institute of Technology
Phone: (404) 385-3499
_______________________________________________
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker
Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf