Re: [Pacemaker] Problem with CRMD restart

2010-11-21 Thread JiaQiang Xu
2010/11/22 Andrew Beekhof : >>> No, there is nothing else that needs to be done. >>> Other parts of pacemaker look at that peer data and will shoot the >>> node if necessary. >>> >> >> Which part of pacemaker will shoot the node on which crmd has been down? > > the policy engine will order it and s

Re: [Pacemaker] Problem with CRMD restart

2010-11-21 Thread Andrew Beekhof
On Sun, Nov 21, 2010 at 3:03 PM, JiaQiang Xu wrote: > 2010/11/20 Andrew Beekhof : >>> I read through the code and found that in this situation the ais >>> plugin will send >>> out a notification message to other nodes in the cluster. But now the >>> nodes won't >>> do anything more than updating p

Re: [Pacemaker] Problem with CRMD restart

2010-11-21 Thread JiaQiang Xu
2010/11/20 Andrew Beekhof : >> I read through the code and found that in this situation the ais >> plugin will send >> out a notification message to other nodes in the cluster. But now the >> nodes won't >> do anything more than updating peer information upon receiving this >> notification. >> >> I

Re: [Pacemaker] Problem with CRMD restart

2010-11-20 Thread Andrew Beekhof
On Fri, Nov 19, 2010 at 10:11 AM, JiaQiang Xu wrote: > Hi, > > I'm using pacemaker 1.0.9 and corosync 1.2.7. > Recently I found a problem with CRMD restart. > > If CRMD crashes or is manually killed, for now corosync will try to restart it > up to 100 times (done in lib/ais/plugin.c). But what if