well, I am not sure if what I did is right or not, but: # vim /etc/ha.d/ha.cf crm on
// now crm_mon displays things as usual !! # crm_mon -i5 ============ Last updated: Wed Jan 6 00:49:04 2010 Stack: Heartbeat Current DC: node2.mydomain.com (8e8ca99f-ff34-45c7-814b-d73d69889441) - partition with quorum Version: 1.0.6-f709c638237cdff7556cb6ab615f32826c0f8c06 2 Nodes configured, unknown expected votes 0 Resources configured. ============ Online: [ node1.mydomain.com node2.mydomain.com ] Now, I 'll configure my resources under pacemaker as I always did and let you know of any progress / problems. Thank you Dejan for keeping up with me on this issue :) ===================================================================================== On Wed, Jan 6, 2010 at 12:08 AM, Angie T. Muhammad <angie.taw...@gmail.com>wrote: > Hello, > Thank you for the prompt reply. > > All permissions are correct, and here is the output of ulimit: > # cd /var/lib/heartbeat/cores/ > # ulimit -a > core file size (blocks, -c) 0 > data seg size (kbytes, -d) unlimited > scheduling priority (-e) 0 > file size (blocks, -f) unlimited > pending signals (-i) 73728 > max locked memory (kbytes, -l) 32 > max memory size (kbytes, -m) unlimited > open files (-n) 1024 > pipe size (512 bytes, -p) 8 > POSIX message queues (bytes, -q) 819200 > real-time priority (-r) 0 > stack size (kbytes, -s) 10240 > cpu time (seconds, -t) unlimited > max user processes (-u) 73728 > virtual memory (kbytes, -v) unlimited > file locks (-x) unlimited > > * > what should I do in this respect?* > > > On Tue, Jan 5, 2010 at 10:37 PM, Dejan Muhamedagic <deja...@fastmail.fm>wrote: > >> Hi, >> >> On Tue, Jan 05, 2010 at 09:47:46PM +0200, Angie T. Muhammad wrote: >> > mmm, I truncated the logs to re-genrate the error and send you the file, >> but >> > the error no longer appears at /var/log/messages now. There were the >> words >> > "kernel" and "segfault" on the last line !!! >> >> Did you enabled coredumps (ulimit -c)? Please check >> /var/lib/heartbeat/cores/*. >> >> > Any way, I'll try to regenerate the error at /var/log/messages and send >> it. >> > Till then, would you please let me know which files exactly you mean >> have >> > wrong permissions? >> >> d /var/lib/heartbeat 0755 root root >> d /var/lib/pengine 0750 hacluster haclient >> d /var/lib/heartbeat/crm 0750 hacluster haclient >> d /var/run/crm 0750 hacluster haclient >> >> Thanks, >> >> Dejan >> >> > Thank you >> > >> > >> > >> > On Tue, Jan 5, 2010 at 9:29 PM, Dejan Muhamedagic <deja...@fastmail.fm >> >wrote: >> > >> > > Hi, >> > > >> > > On Tue, Jan 05, 2010 at 09:19:16PM +0200, Angie T. Muhammad wrote: >> > > > Hello all >> > > > >> > > > Thank you Dejan and Dr. Schwartzkopff >> > > > But please bear with me because I'm still suffering a problem. Here >> is >> > > what >> > > > I did: >> > > > >> > > > # wget -O /etc/yum.repos.d/clusterlabs.repo >> > > > http://clusterlabs.org/rpm/epel-5/clusterlabs.repo >> > > > # yum install pacemaker pacemaker-libs cluster-glue >> cluster-glue-libs >> > > > resource-agents heartbeat >> > > > >> > > >> ============================================================================================================================================================= >> > > > Package Arch >> > > > Version Repository >> > > > Size >> > > > >> > > >> ============================================================================================================================================================= >> > > > Installing: >> > > > cluster-glue x86_64 >> > > > 1.0.1-1.el5 clusterlabs >> > > > 262 k >> > > > cluster-glue-libs x86_64 >> > > > 1.0.1-1.el5 clusterlabs >> > > > 130 k >> > > > heartbeat x86_64 >> > > > 3.0.1-1.el5 clusterlabs >> > > > 193 k >> > > > pacemaker x86_64 >> > > > 1.0.6-1.el5 clusterlabs >> > > > 689 k >> > > > pacemaker-libs x86_64 >> > > > 1.0.6-1.el5 clusterlabs >> > > > 310 k >> > > > resource-agents x86_64 >> > > > 1.0.1-1.el5 clusterlabs >> > > > 179 k >> > > > Installing for dependencies: >> > > > corosync x86_64 >> > > > 1.1.2-1.el5 clusterlabs >> > > > 163 k >> > > > corosynclib x86_64 >> > > > 1.1.2-1.el5 clusterlabs >> > > > 163 k >> > > > heartbeat-libs x86_64 >> > > > 3.0.1-1.el5 clusterlabs >> > > > 292 k >> > > > libesmtp x86_64 >> > > > 1.0.4-5.el5 epel >> > > > 60 k >> > > > libibverbs x86_64 >> > > > 1.1.2-4.el5 base >> > > > 44 k >> > > > librdmacm x86_64 >> > > > 1.0.8-5.el5 base >> > > > 22 k >> > > > openhpi-libs x86_64 >> > > > 2.14.0-5.el5 base >> > > > 168 k >> > > > openib noarch >> > > > 1.4.1-3.el5 base >> > > > 20 k >> > > > >> > > > Transaction Summary >> > > > >> > > >> ============================================================================================================================================================= >> > > > Install 14 Package(s) >> > > > Update 0 Package(s) >> > > > Remove 0 Package(s) >> > > > >> > > > Total download size: 2.6 M >> > > > >> > > > # vim /etc/ha.d/ha.cf >> > > > keepalive 2 >> > > > deadtime 30 >> > > > warntime 10 >> > > > initdead 120 >> > > > udpport 694 >> > > > ucast eth1 10.0.0.101 >> > > > auto_failback on >> > > > node node1.mydomain.com >> > > > node node2.mydomain.com >> > > > use_logd yes >> > > > >> > > > // and I changed the ucast directive properly for each node >> > > > >> > > > # vim /etc/ha.d/authkeys >> > > > # chmod 600 /etc/ha.d/authkeys >> > > > # /etc/init.d/heartbeat start >> > > > Starting High-Availability services: [ OK ] >> > > > // started properly on both nodes >> > > > >> > > > # crm_mon -i5 >> > > > Attempting connection to the cluster.... >> > > > >> > > > # strace -o hb-again crm_mon -i5 >> > > > // the file is attached >> > > > >> > > > // I didn't find perl on the system , so I installed it >> > > > # yum install perl >> > > > >> > > > // indeed, i believe the error is at around 92% of the strace output >> file >> > > > when it attempts to: >> > > > >> > > > connect(3, {sa_family=AF_FILE, path="/var/run/crm/cib_ro"...}, 110) >> = -1 >> > > > ENOENT (No such file or directory) >> > > > close(3) = 0 >> > > > socket(PF_FILE, SOCK_STREAM, 0) = 3 >> > > > fcntl(3, F_GETFL) = 0x2 (flags O_RDWR) >> > > > fcntl(3, F_SETFL, O_RDWR|O_NONBLOCK) = 0 >> > > > connect(3, {sa_family=AF_FILE, path="/var/run/crm/cib_callback"...}, >> 110) >> > > = >> > > > -1 ENOENT (No such file or directory) >> > > >> > > Looks like cib didn't start. The logs should say why. Perhaps >> > > there are permission problems? >> > > >> > > Thanks, >> > > >> > > Dejan >> > > >> > > > I can't understand why it can not run :( .. >> > > > Version 1.0.5 of pace maker and openais 0.80.5 worked like a charm >> on the >> > > > same nodes. >> > > > Now I have to shift to heartbeat because of unicast directive. >> Please >> > > help! >> > > > >> > > > Thank you in advance >> > > > >> > > > >> > > > On Tue, Jan 5, 2010 at 2:17 PM, Michael Schwartzkopff < >> mi...@multinet.de >> > > >wrote: >> > > > >> > > > > Am Dienstag, 5. Januar 2010 13:00:44 schrieb Dejan Muhamedagic: >> > > > > > Hi, >> > > > > > >> > > > > > On Tue, Jan 05, 2010 at 01:51:38PM +0200, Angie T. Muhammad >> wrote: >> > > > > > > Hello all, >> > > > > > > Hope you spent good time on holidays! >> > > > > > > >> > > > > > > Our data center does not support multicast and I have been >> googling >> > > > > > > "unicast site:openais.org" but now results. >> > > > > > > And changing our data center is not an option at the moment. >> > > > > > > >> > > > > > > I wonder does any beta version of openais support unicast? >> > > > > > >> > > > > > I think that the latest corosync (1.2.0) supports broadcast. >> > > > > > >> > > > > > > If not, do you have any link to pacemaker installation with >> > > heartbeat >> > > > > > > stack? >> > > > > > >> > > > > > clusterlabs.org has some installation docs and there are also >> > > > > > brand new docs at http://linux-ha.org/wiki/Documentation >> > > > > > >> > > > > > Thanks, >> > > > > > >> > > > > > Dejan >> > > > > > >> > > > > > > Indeed, I would be very grateful if you could suggest me any >> other >> > > > > > > solution? >> > > > > >> > > > > >> > > > > Perhaps you could use a tunnel (gre, ...) to route the multicast. >> > > > > >> > > > > -- >> > > > > Dr. Michael Schwartzkopff >> > > > > MultiNET Services GmbH >> > > > > Addresse: Bretonischer Ring 7; 85630 Grasbrunn; Germany >> > > > > Tel: +49 - 89 - 45 69 11 0 >> > > > > Fax: +49 - 89 - 45 69 11 21 >> > > > > mob: +49 - 174 - 343 28 75 >> > > > > >> > > > > mail: mi...@multinet.de >> > > > > web: www.multinet.de >> > > > > >> > > > > Sitz der Gesellschaft: 85630 Grasbrunn >> > > > > Registergericht: Amtsgericht München HRB 114375 >> > > > > Geschäftsführer: Günter Jurgeneit, Hubert Martens >> > > > > >> > > > > --- >> > > > > >> > > > > PGP Fingerprint: F919 3919 FF12 ED5A 2801 DEA6 AA77 57A4 EDD8 979B >> > > > > Skype: misch42 >> > > > > >> > > > > _______________________________________________ >> > > > > Pacemaker mailing list >> > > > > Pacemaker@oss.clusterlabs.org >> > > > > http://oss.clusterlabs.org/mailman/listinfo/pacemaker >> > > > > >> > > > >> > > > >> > > > >> > > > -- >> > > > All the best, >> > > > Angie >> > > >> > > >> > > > _______________________________________________ >> > > > Pacemaker mailing list >> > > > Pacemaker@oss.clusterlabs.org >> > > > http://oss.clusterlabs.org/mailman/listinfo/pacemaker >> > > >> > > >> > > _______________________________________________ >> > > Pacemaker mailing list >> > > Pacemaker@oss.clusterlabs.org >> > > http://oss.clusterlabs.org/mailman/listinfo/pacemaker >> > > >> > >> > >> > >> > -- >> > All the best, >> > Angie >> >> > _______________________________________________ >> > Pacemaker mailing list >> > Pacemaker@oss.clusterlabs.org >> > http://oss.clusterlabs.org/mailman/listinfo/pacemaker >> >> >> _______________________________________________ >> Pacemaker mailing list >> Pacemaker@oss.clusterlabs.org >> http://oss.clusterlabs.org/mailman/listinfo/pacemaker >> > > > > -- > All the best, > Angie > -- All the best, Angie
_______________________________________________ Pacemaker mailing list Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker