Hello, On 11/23/2011 03:30 PM, Vadim Bulst wrote: > Hi list, > > I try to bring up a 3-node cluster running Ubuntu Omeiric. The packages > I used are all from the Ubuntu-repo: pacemaker 1.1.5, corosync 1.3.0, > clvm 2.02.66. > > I'm using teamed and bridged interfaces for networking. Every node has > only one address for cluster management. > All nodes are connected to a FC-san and see the same volumes. Iptables > --list shows no rules. > > My problem: > > clvm resource is not coming up and when I commit some changes on > resources in most cases one node is dying.
there are some constraints missing in your config ... or better, use a cloned group ... see below ... And use Corosync 1.4.x with redundant rings and automatic ring recovery feature enabled. > > crm_mon: > > ============ > Last updated: Wed Nov 23 14:47:58 2011 > Stack: openais > Current DC: bbzclnode07 - partition with quorum > Version: 1.1.5-01e86afaaa6d4a8c4836f68df80ababd6ca3902f > 3 Nodes configured, 3 expected votes > 2 Resources configured. > ============ > > Online: [ bbzclnode06 bbzclnode04 bbzclnode07 ] > > Clone Set: dlm-clone [dlm] > Started: [ bbzclnode06 bbzclnode07 bbzclnode04 ] > > Failed actions: > clvm:0_start_0 (node=bbzclnode06, call=31, rc=1, status=complete): > unknown error > clvm:1_start_0 (node=bbzclnode07, call=5, rc=1, status=complete): > unknown error > clvm:0_start_0 (node=bbzclnode04, call=29, rc=1, status=complete): > unknown error > > > > > I configured corosync like this: > > totem { > version: 2 > token: 3000 > token_retransmits_before_loss_const: 10 > join: 60 > consensus: 3600 > vsftype: none > max_messages: 20 > clear_node_high_bit: yes > secauth: on > threads: 8 > rrp_mode: none > interface { > ringnumber: 0 > bindnetaddr: 192.168.128.0 > mcastaddr: 226.94.1.1 > mcastport: 5405 > } > } > amf { > mode: disabled > } > service { > ver: 0 > name: pacemaker > } > aisexec { > user: root > group: root > } > logging { > fileline: off > to_stderr: yes > to_logfile: no > to_syslog: yes > syslog_facility: daemon > debug: off > timestamp: on > logger_subsys { > subsys: AMF > debug: off > tags: enter|leave|trace1|trace2|trace3|trace4|trace6 > } > } > > and my cib looks like this: > > node bbzclnode04 > node bbzclnode06 > node bbzclnode07 > primitive clvm ocf:lvm2:clvmd \ > params daemon_timeout="30" \ > meta target-role="started" > primitive m ocf:pacemaker:controld \ > meta target-role="started" > clone clvm-clone clvm \ > meta clone-max="3" clone-node-max="1" > clone dlm-clone dlm \ > meta clone-max="3" clone-node-max="1" ommit those two clones, use a cloned group: group g_lock dlm clvm clone g_lock \ meta interleave="true" > property $id="cib-bootstrap-options" \ > dc-version="1.1.5-01e86afaaa6d4a8c4836f68df80ababd6ca3902f" \ > cluster-infrastructure="openais" \ > expected-quorum-votes="3" \ > no-quorum-policy="ignore" \ > stonith-enabled="false" \ > last-lrm-refresh="1322049979" Don't forget to set up stonith in a productive system when using shared storage. Regards, Andreas -- Need help with Pacemaker? http://www.hastexo.com/now > > > Thanks for your help. > > > Cheers, > > Vadim > > > > _______________________________________________ > Pacemaker mailing list: Pacemaker@oss.clusterlabs.org > http://oss.clusterlabs.org/mailman/listinfo/pacemaker > > Project Home: http://www.clusterlabs.org > Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf > Bugs: http://bugs.clusterlabs.org
signature.asc
Description: OpenPGP digital signature
_______________________________________________ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://bugs.clusterlabs.org