On 2012-08-16T17:54:06, "EXTERNAL Konold Martin (erfrakon, RtP2/TEF72)"
<[email protected]> wrote:
Hi Martin,
> From my experience with SLES11 SP2 (with all current updates) I conclude that
> actually nobody is seriously running SP2 without local bugfixes.
That isn't quite true.
> E.g. Even the most simple examples from the official SuSE documentation don't
> work as expected.
Which ones?
> A trivial example is ocf:heartbeat:exportfs as distributed by SuSE with SP2
> causes unlimited growth of .rmtab files (goes fast in the gigabytes for
> serious NFS servers). I could work around this issue using some shell
> scripting.
This is an annoying bug, yes. It's an upstream bug that's been fixed
now. I'll check if the maintenance update is already released.
> There are other issues which are more than annoying and actually make the
> SLES SP2 HA Extension unusable for production systems. E.g. clvmd cannot be
> made less verbose from the cluster configuration. (No daemon_options="-d0"
> does not help!)
It shouldn't log that much even at the regular loglevel. It's reasonably
quiet (except on fail-/switch-overs, of course). What do you find
excessive?
> Not funny is also the fact that the official SLES 11 SP2 kernels crash
> seriously (when a node rejoins the cluster) when using STCP as
> recommended in the SLES HA documentation and offered via the wizards.
> It took me a while to find out what was going on.
We've not observed this. Have you reported a bug?
> When setting up a system with many (rather simple) resources funny things
> happen due to race conditions all over the place. (can be worked around
> mostly using arbitrary start-delay options.
I've not encountered this either. Sorry for asking this, but: did you
report a bug?
> Oh, did I mention that situations which are actually forbidden by constraints
> (e.g. using a score of INFINITY) actually do happen... Depending on the
> environment this can lead to not so funny effects.
That would be a serious bug in the policy engine (and not just limited
to SLE HA 11 SP2).
> E.g. I defined the following constraints:
>
> colocation c17 inf: p_lsb_ccslogserver p_fs_daten
> order o34 inf: p_fs_daten p_lsb_ccslogserver:start
>
> I can proof from the logs that ccslogserver (an application) got migrated
> from node A to node B while p_fs_daten (a filesystem on top of drbd) was
> definitely still running on node A
I'd be very, very interested in seeing these logs. The rules you
specified above should not allow for that, and I can't immediately
imagine other rules that still might allow for it.
> Reporting bugs is not possible without a direct support contract. (You must
> enter into a support contract with SuSE before you can even report a bug or
> provide a patch ....)
Strangely enough, enterprise distributions target paying customers. This
is not, I believe, a SUSE-specific constraint.
You can always file a bug against the upstream projects in the
respective communities; these will then possibly tell you to upgrade to
latest upstream first and reproduce. Eventually, these bugs will trickle
back into the enterprise distributions as well. That may just take a
while.
But yes, SLE HA (and RHEL clustering too) sort-of target customers who
have support contracts, either will SUSE/RHT or a strong consulting
partner (who preferably is a high-grade technology partner with the
distributor).
I admit I don't find this particular complaint convincing.
Regards,
Lars
--
Architect Storage/HA
SUSE LINUX Products GmbH, GF: Jeff Hawn, Jennifer Guild, Felix Imendörffer, HRB
21284 (AG Nürnberg)
"Experience is the name everyone gives to their mistakes." -- Oscar Wilde
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems