Hello, I have a working active/active ocfs2+drbd cluster, based on cman stack, with dlm_controld from cman packages. This cluster is working on top of ubuntu 12.04 LTS, which does not provide a dlm_controld with pcmk stack, so I had to add all cman stack.
But I have some problems with fencing, fenced daemon logs errors, and before spending extra time, I thought it would be a good idea to try to run things only in corosync/openais/pacemaker stack, building from sources. Not a good idea by now ;) The fist problem was that cluster removed support for dlm_controld with pacemaker stack. Last version with support was 3.0.17. But this was done some years ago, and as far as I have been able to understand, things are still broken. A brief details about the problems I have found: The most relevant info found about this issue are these threads from Andrew Beekhof and Vladislav Bogdanov, wich suggest to compile dlm_controld from Cluster, applying some patches. They report it worked (whith some remaining issues): http://oss.clusterlabs.org/pipermail/pacemaker/2009-October/003064.html http://www.mail-archive.com/[email protected]/msg09959.html But most recent issue about this is a year ago, and seems that things are still broken. I haven't been able to compile, with lots of errors, so I'm currently asking if this is the right way, becouse seems that nobody else is willing to use this... At cluster page, they state that now DLM code has been separated from cluster: https://fedorahosted.org/cluster/wiki/HomePage But this dlm project (that seems to have pcmk support), depends on corosync 2.0, so it can't run with last pacemaker (1.1.8). (can it?) http://git.fedorahosted.org/git/dlm.git Before spending more time with this, I wanted to ask for the right way to do things. So Questions are: (1) Is by now factible an ocfs2 corosync + pacemaker cluster, without cman, and dlm_controld with pcmk stack? (if yes which repos/versions)? (2) What is the future roadmap about this? Will future corosync2.0 cluster have dlm issues addressed? Also, I have read (also Andrew post) that OCFS2 cluster could have problems on top of corosync 2.0, as OCFS2 has't ben ported (GFS2 was ported). http://www.gossamer-threads.com/lists/linuxha/pacemaker/78538 so: (3) Is GSF2 a better future option in terms of support, for linux-ha clusters? More details about pcmk dlm_controld: I found that Suse have always been mantaining cman-free cluster stack, so I have tried to find dlm in its packages. Found: http://rpmfind.net//linux/RPM/opensuse/factory/x86_64/libdlm-3.00.01-24.5.x86_64.html But also I have had compilation problems (with pacemaker 1.1.7 has a libcib error, but can't compile with 1.1.8 as is not supported) Thanks and Regards, Bernardo -- APSL BERNARDO CABEZAS SERRA RESPONSABLE SISTEMAS Ada Byron, edificio NTIC 2ÂșA 07121 ParcBit Mail: [email protected] Skype: bernat.cabezas Tel: 971439771 _______________________________________________ Linux-HA mailing list [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
