Hi, On Mon, Mar 14, 2011 at 10:57:27AM -0000, darren.mans...@opengi.co.uk wrote: > Hello everyone. > > > > I built and put into production without adequate testing a 2 node > cluster running Ubuntu 10.04 LTS with Pacemaker and associated packages > from the Ubuntu-HA-maintainers repo > (https://launchpad.net/~ubuntu-ha-maintainers/+archive/ppa).
Not good to go live without sufficient testing. Testing is as important as anything else. Or even more important. If there isn't enough time for testing, then better to go without clustering. > I've always had many problems with my build, mainly because it was > over-complicated and I didn't have adequate time to test it and tweak it > before putting it live. If I list my problems below, could anyone have a > look and see if there is anything obvious? Thanks. > [...] > 2. Crm shell won't load from a text file. When I use crm configure > < crm.txt, it will run through the file, complaining about the default > timeout being less than 240, but doesn't load anything. So I go into the > crm shell and set default-action-timeout to 240, commit and exit and do > the same. This time it just exits silently, without loading the config. Strange. I assume that you run version 1.0.x which I don't use very often, but I cannot recall seeing this problem. > If I go into the crm shell and use load replace crm.txt it will work. Loading from a file was really meant to be done with "configure load". Now, if there are errors/warnings in the configuration, what happens depends on check-* options for semantic checks. > 3. Crm shell tab completes don't work unless you put an incorrect > entry in first. I'm sure this is a python readline problem, as it also > happens in SLE 11 HAE SP1 (but not in pre-SP1). I assume everyone > associated (Dejan?) is aware of the problem, but highlighting it just in > case. No, I'm not aware of it. Tab completion works here, though a bit differently from 1.0 due to lazy creation of the completion tables. You need to enter another level at least once before the tab completion is going to work for that level. For instance, it won't work in this case: crm(live)# resource <TAB><TAB> But it would once the user enters the resource level: crm(live)resource# <TAB><TAB> bye failcount move restart unmigrate cd help param show unmove cleanup list promote start up demote manage quit status utilization end meta refresh stop exit migrate reprobe unmanage Can you elaborate "put incorrect entry first"? Thanks, Dejan > I've attached my crm config, cib XML, /etc/drbd.conf for reference. > Please forgive my SSH STONITH, I've not had chance to get the IBM RSA > configured on it yet. > > > > Thanks all! > > Best regards, > > Darren Mansell _______________________________________________ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker