Re: [Pacemaker] Reason for cluster resource migration

2013-02-15 Thread Andrew Martin
- Original Message - > From: "Ante Karamatić" > To: pacemaker@oss.clusterlabs.org > Sent: Thursday, February 14, 2013 3:57:38 AM > Subject: Re: [Pacemaker] Reason for cluster resource migration > > On 13.02.2013 16:27, Andrew Martin wrote:: > > > Unf

Re: [Pacemaker] Reason for cluster resource migration

2013-02-14 Thread Ante Karamatić
On 13.02.2013 16:27, Andrew Martin wrote:: > Unfortunately the pacemaker and corosync packages in the Ubuntu > repositories are too old. Due to bugs in these versions, I > upgraded to the latest Pacemaker 1.1.8 and Corosync 2.1.0 (it was > the latest at that time). We tend to backport security

Re: [Pacemaker] Reason for cluster resource migration

2013-02-13 Thread Andrew Beekhof
On Thu, Feb 14, 2013 at 4:28 AM, Andrew Martin wrote: > - Original Message - >> From: "Andrew Beekhof" >> To: "The Pacemaker cluster resource manager" >> Sent: Tuesday, February 12, 2013 10:52:23 PM >> Subject: Re: [Pacemaker] Reason for

Re: [Pacemaker] Reason for cluster resource migration

2013-02-13 Thread Andrew Martin
- Original Message - > From: "Andrew Beekhof" > To: "The Pacemaker cluster resource manager" > Sent: Tuesday, February 12, 2013 10:52:23 PM > Subject: Re: [Pacemaker] Reason for cluster resource migration > > On Wed, Feb 13, 2013 at 2:04 AM, Andr

Re: [Pacemaker] Reason for cluster resource migration

2013-02-13 Thread Andrew Martin
- Original Message - > From: "Ante Karamatic" > To: pacemaker@oss.clusterlabs.org > Sent: Wednesday, February 13, 2013 1:53:34 AM > Subject: Re: [Pacemaker] Reason for cluster resource migration > > On 13.02.2013 05:57, Andrew Beekhof wrote:: > >

Re: [Pacemaker] Reason for cluster resource migration

2013-02-12 Thread Ante Karamatic
On 13.02.2013 05:57, Andrew Beekhof wrote:: > This link has some useful info: > > https://wiki.ubuntu.com/DebuggingProgramCrash#Debug_Symbol_Packages For corosync, we build -dbg package, so one can just install corosync-dbg. Since this is a pacemaker related problem, one should add ddebs archive

Re: [Pacemaker] Reason for cluster resource migration

2013-02-12 Thread Andrew Beekhof
ebruary 11, 2013 10:11:53 PM >>> Subject: Re: [Pacemaker] Reason for cluster resource migration >>> >>> On Tue, Feb 12, 2013 at 3:07 PM, Andrew Beekhof >>> wrote: >>> > On Tue, Feb 12, 2013 at 3:01 PM, Andrew Beekhof >>> > wrote: >>

Re: [Pacemaker] Reason for cluster resource migration

2013-02-12 Thread Andrew Beekhof
On Wed, Feb 13, 2013 at 2:04 AM, Andrew Martin wrote: > - Original Message - >> From: "Andrew Beekhof" >> To: "The Pacemaker cluster resource manager" >> Sent: Monday, February 11, 2013 10:11:53 PM >> Subject: Re: [Pacemaker] Reason for

Re: [Pacemaker] Reason for cluster resource migration

2013-02-12 Thread Andrew Beekhof
On Tue, Feb 12, 2013 at 5:28 PM, Vladislav Bogdanov wrote: > 12.02.2013 07:11, Andrew Beekhof wrote: >> On Tue, Feb 12, 2013 at 3:07 PM, Andrew Beekhof wrote: > [...] >>> So we'll still need the crm_report, it will have more detail on the >>> "Child process pengine terminated with signal 6 (pid=1

Re: [Pacemaker] Reason for cluster resource migration

2013-02-12 Thread Andrew Martin
- Original Message - > From: "Andrew Beekhof" > To: "The Pacemaker cluster resource manager" > Sent: Monday, February 11, 2013 10:11:53 PM > Subject: Re: [Pacemaker] Reason for cluster resource migration > > On Tue, Feb 12, 2013 at 3:07 PM, Andre

Re: [Pacemaker] Reason for cluster resource migration

2013-02-11 Thread Vladislav Bogdanov
12.02.2013 07:11, Andrew Beekhof wrote: > On Tue, Feb 12, 2013 at 3:07 PM, Andrew Beekhof wrote: [...] >> So we'll still need the crm_report, it will have more detail on the >> "Child process pengine terminated with signal 6 (pid=19357, core=128)" >> part. > > Signal 6 is an assertion failure, bu

Re: [Pacemaker] Reason for cluster resource migration

2013-02-11 Thread Andrew Beekhof
the crash. > > So we'll still need the crm_report, it will have more detail on the > "Child process pengine terminated with signal 6 (pid=19357, core=128)" > part. Signal 6 is an assertion failure, but strangely there is no mention of one in syslog. Can you grep /var/log

Re: [Pacemaker] Reason for cluster resource migration

2013-02-11 Thread Andrew Beekhof
ave more detail on the "Child process pengine terminated with signal 6 (pid=19357, core=128)" part. The core file will likely be somewhere under /var/lib/pacemaker/cores but crm_report should be able to find it. > >> >> Thanks, >> >> Andrew >> >> &

Re: [Pacemaker] Reason for cluster resource migration

2013-02-11 Thread Andrew Beekhof
Also very important is the contents of: /var/lib/pacemaker/pengine/pe-core-c9aef461-386c-4e4f-b509-0c9c8d80409b.bz2 > > Thanks, > > Andrew > > > > > - Original Message ----- >> From: "Andrew Martin" >> To: "The Pacemaker cluster resource manager" >&

Re: [Pacemaker] Reason for cluster resource migration

2013-02-11 Thread Andrew Martin
aker cluster resource manager" > Sent: Friday, February 1, 2013 4:32:26 PM > Subject: Re: [Pacemaker] Reason for cluster resource migration > > - Original Message - > > From: "Andrew Beekhof" > > To: "The Pacemaker cluster resource manager" &g

Re: [Pacemaker] Reason for cluster resource migration

2013-02-01 Thread Andrew Martin
- Original Message - > From: "Andrew Beekhof" > To: "The Pacemaker cluster resource manager" > Sent: Thursday, December 6, 2012 8:36:27 PM > Subject: Re: [Pacemaker] Reason for cluster resource migration > > On Wed, Dec 5, 2012 at 8:29 AM, Andrew Ma

Re: [Pacemaker] Reason for cluster resource migration

2012-12-06 Thread Andrew Beekhof
On Wed, Dec 5, 2012 at 8:29 AM, Andrew Martin wrote: > Hello, > > I am running a 3-node Pacemaker cluster (2 "real" nodes and 1 quorum node in > standby) on Ubuntu 12.04 server (amd64) with Pacemaker 1.1.8 and Corosync > 2.1.0. My cluster configuration is: > http://pastebin.com/6TPkWtbt > > Recent