On Mon, 05 Jul 2010 18:34:24 +0300, Dan Frincu <dfri...@streamwide.ro>
wrote:
 
> /The errors from the log file are DRBD specific, they occur when you're
> trying to mount a resource in a Secondary state. 
> Increase the "op start interval" for both the DRBD and Filesystem
> primitives to ~15 seconds. Having configured a start 
> interval of 0 (zero) seconds, the change of DRBD resource from Primary
to
> Secondary on node2 and then promotion to 
> Primary on node1 is not instantaneous, therefore Pacemaker attempts to
> mount the filesystem without having the DRBD 
> resource in a Primary state, it goes into that huuuge 300 second
timeout,
> but as it waits for one resource (DRBD) to 
> timeout, it executes the next one, which is the mount, which fails, with
> the given errors, for the aforementioned reasons.
> 
> I'd also suggest adding an "op monitor" for each resource, with a
> reasonable interval and timeout, and also a mail alert.
> 
> Regards,
> Dan

Ok, that almost solved the problem.
But now the Filesystem primitives run in an endless loop.
The get unmounted and mounted again.


> therefore Pacemaker attempts to
> mount the filesystem without having the DRBD 
> resource in a Primary state

Hm, until now I thought this is handled by
the 3 "order" restrictions.

I see I have to find out which intervalls and timeouts I need to adjust.
Thanks for giving me a hint to the right direction so quickly.

If you have some other ideas to improve the config, just let me now.

Cheers, Sven

_______________________________________________
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker

Reply via email to