Ok, reinstalled and re-tested. What I've learned:
- HA only works now if OOB is configured, the old way HA no longer applies - this can be good and bad, not everyone has IPMIs - HA only works if IPMI is reachable. I've pulled the cord on a HV and HA failed to do its thing, leaving me with a HV down along with all the VMs running there. That's bad. I've opened this ticket for it: https://issues.apache.org/jira/browse/CLOUDSTACK-10234 Let me know if you need any extra info or stuff to test. Regards, Lucian -- Sent from the Delta quadrant using Borg technology! Nux! www.nux.ro ----- Original Message ----- > From: "Nux!" <n...@li.nux.ro> > To: "dev" <dev@cloudstack.apache.org> > Sent: Tuesday, 16 January, 2018 11:35:58 > Subject: Re: HA issues > I'll reinstall my setup and try again, just to be sure I'm working on a clean > slate. > > -- > Sent from the Delta quadrant using Borg technology! > > Nux! > www.nux.ro > > ----- Original Message ----- >> From: "Rohit Yadav" <rohit.ya...@shapeblue.com> >> To: "dev" <dev@cloudstack.apache.org> >> Sent: Tuesday, 16 January, 2018 11:29:51 >> Subject: Re: HA issues > >> Hi Lucian, >> >> >> If you're talking about the new HostHA feature (with KVM+nfs+ipmi), please >> refer >> to following docs: >> >> http://docs.cloudstack.apache.org/projects/cloudstack-administration/en/latest/hosts.html#out-of-band-management >> >> https://cwiki.apache.org/confluence/display/CLOUDSTACK/Host+HA >> >> >> We'll need to you look at logs perhaps create a JIRA ticket with the logs and >> details? If you saw ipmi based reboot, then host-ha indeed tried to recover >> i.e. reboot the host, once hostha has done its work it would schedule HA for >> VM >> as soon as the recovery operation succeeds (we've simulator and kvm based >> marvin tests for such scenarios). >> >> >> Can you see it making attempt to schedule VM ha in logs, or any failure? >> >> >> - Rohit >> >> <https://cloudstack.apache.org> >> >> >> >> ________________________________ >> From: Nux! <n...@li.nux.ro> >> Sent: Tuesday, January 16, 2018 12:47:56 AM >> To: dev >> Subject: [4.11] HA issues >> >> Hi, >> >> I see there's a new HA engine for KVM and IPMI support which is really nice, >> however it seems hit and miss. >> I have created an instance with HA offering, kernel panicked one of the >> hypervisors - after a while the server was rebooted via IPMI probably, but >> the >> instance never moved to a running hypervisor and even after the original >> hypervisor came back it was still left in Stopped state. >> Is there any extra things I need to set up to have proper HA? >> >> Regards, >> Lucian >> >> -- >> Sent from the Delta quadrant using Borg technology! >> >> Nux! >> www.nux.ro >> >> rohit.ya...@shapeblue.com >> www.shapeblue.com >> 53 Chandos Place, Covent Garden, London WC2N 4HSUK > > @shapeblue