Hi Andrew and thanks , but what is the value of "expiry period" ? on the cluster I face the problem, the failcount has not been reset after several days ... Alain
De : Andrew Beekhof <[email protected]> A : General Linux-HA mailing list <[email protected]> Date : 08/01/2013 00:07 Objet : Re: [Linux-HA] Question about failcounts Envoyé par : [email protected] On Mon, Jan 7, 2013 at 6:43 PM, <[email protected]> wrote: > Hi, > > I thought that if for example we have a migration-threshold=3 , and if > the primitive fails to start the first time, but starts on second try on > same node, the failcount for this primitive was automatically reseted for > this node, am I wrong ? Yes. It is removed after the expiry period, not after the next success. Consider a resource for which the start succeeds but monitors always fail. > > because I've found a configuration where crm_mon does not display any > resource "Failed ...", and the primitive was running fine, but the > failcount for this primitive remains 1 for the node where this primitive > was currently running ... > > Is this behavior normal ? > > This would mean that, even if crm_mon does not display any resource > Failed, we must check by crm resource failcount ... the value of all > failcounts ... and execute a cleanup if the failcount is not 0 ? > > Thanks for your answer. > Regards > Alain > _______________________________________________ > Linux-HA mailing list > [email protected] > http://lists.linux-ha.org/mailman/listinfo/linux-ha > See also: http://linux-ha.org/ReportingProblems _______________________________________________ Linux-HA mailing list [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems _______________________________________________ Linux-HA mailing list [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
