On Wed, Feb 6, 2013 at 11:41 PM, James Guthrie <j...@open.ch> wrote:
> Hi David,
>
> Unfortunately crm_report doesn't work correctly on my hosts as we have 
> compiled from source with custom paths and apparently the crm_report and 
> associated tools are not built to use the paths that can be customised with 
> autoconf.

It certainly tries to:

   https://github.com/beekhof/pacemaker/blob/master/tools/report.common#L99

What does it say on your system (or, what paths did you give to autoconf)?

>
> Despite that, I have done some investigation and think I may have found an 
> inconsistency. I have attached the pacemaker-relevant syslog, including the 
> pe-input files.

Great, I'll take a look now.

> The logfile starts where pacemaker detects that sub-squid is not running on 
> mu. It then fails over to nu, where two further failures take place. In order 
> to recover from these failures, the pengine produces transitions 106, 107, 
> 108 and 109, with the corresponding pe-input files 46, 47, 48 and 49.
>
> The way I understand it, pacemaker works through the transitions until 
> something happens from outside, at which point the transitions are 
> recalculated and pacemaker continues on.
>
> Using crm_simulate to observe the transitions that should happen tells me 
> that the transitions that were calculated from pe-input-49 ought to have 
> resulted in the resources conntrackd, condition, sub-ospfd, sub-ripd and 
> sub-squid being promote to master. In fact, this never happens, but the crmd 
> reports the transition as being complete. It appears as though nowhere is it 
> acknowledged that the current state is not the desired outcome as calculated 
> by the pengine. Is it possible that this is a bug?
>
> Regards,
> James
>
>
>
> On Feb 5, 2013, at 7:41 PM, David Vossel <dvos...@redhat.com> wrote:
>
>>
>>
>> ----- Original Message -----
>>> From: "James Guthrie" <j...@open.ch>
>>> To: "The Pacemaker cluster resource manager" <pacemaker@oss.clusterlabs.org>
>>> Sent: Tuesday, February 5, 2013 8:12:57 AM
>>> Subject: Re: [Pacemaker] Pacemaker resource migration behaviour
>>>
>>> Hi all,
>>>
>>> as a follow-up to this, I realised that I needed to slightly change
>>> the way the resource constraints are put together, but I'm still
>>> seeing the same behaviour.
>>>
>
>>> Below are an excerpt from the logs on the host and the revised xml
>>> configuration. In this case, I caused two failures on the host mu,
>>> which forced the resources onto nu then I forced two failures on nu.
>>> What can be seen in the logs are the two detected failures on nu
>>> (the "warning: update_failcount:" lines). After the two failures on
>>> nu, the VIP is migrated back to mu, but none of the "support"
>>> resources are promoted with it.
>>
>> I can't tell much from this output.
>>
>> Run the steps you use to reproduce this and create a crm_report of the issue 
>> so we can see both the logs and pengine transition files that proceed this.
>>
>> -- Vossel
>>
>>
>>> Regards,
>>> James
>>>
>>
>> _______________________________________________
>> Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
>> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
>>
>> Project Home: http://www.clusterlabs.org
>> Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
>> Bugs: http://bugs.clusterlabs.org
>
>
> _______________________________________________
> Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
>
> Project Home: http://www.clusterlabs.org
> Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> Bugs: http://bugs.clusterlabs.org
>

_______________________________________________
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://bugs.clusterlabs.org

Reply via email to