Re: [Openstack] HA Compute & Instance Evacuation

2018-05-02 Thread John Petrini
Take this with a grain of salt because we're using the original version before the project moved under the Big Tent and I'm not sure how much it's evolved since then. I assume the basic functions are the same though. You're correct; Corosync and Pacemaker are used to determine if a compute node go

Re: [Openstack] HA Compute & Instance Evacuation

2018-05-02 Thread Torin Woltjer
I'm vaguely familiar with Pacemaker/Corosync, as I'm using it with HAProxy on my controller nodes. I'm assuming in this instance that you use Pacemaker on your compute hosts so masakari can detect host outages? If possible could you go into more detail about the configuration? I would like to us

Re: [Openstack] HA Compute & Instance Evacuation

2018-05-02 Thread Torin Woltjer
> There is no HA behaviour for compute nodes. > > You are referring to HA of workloads running on compute nodes, not HA of > compute nodes themselves. It was a mistake for me to say HA when referring to compute and instances. Really I want to avoid a situation where one of my compute hosts gives u

Re: [Openstack] HA Compute & Instance Evacuation

2018-05-02 Thread John Petrini
We're using the original Masakari project for this and it works really well. In fact just last week we lost a compute node and all of VM's were successfully migrated to a reserve host in under 5 minutes. It's a really nice feeling when your infrastructure heals itself before you even get a chance t

Re: [Openstack] [masakari] HA Compute & Instance Evacuation

2018-05-02 Thread Jay Pipes
On 05/02/2018 04:39 PM, Torin Woltjer wrote: > There is no HA behaviour for compute nodes. > > You are referring to HA of workloads running on compute nodes, not HA of > compute nodes themselves. It was a mistake for me to say HA when referring to compute and instances. Really I want to avoi

Re: [Openstack] HA Compute & Instance Evacuation

2018-05-02 Thread Jay Pipes
On 05/02/2018 02:43 PM, Torin Woltjer wrote: I am working on setting up Openstack for HA and one of the last orders of business is getting HA behavior out of the compute nodes. There is no HA behaviour for compute nodes. Is there a project that will automatically evacuate instances from a dow

[Openstack] HA Compute & Instance Evacuation

2018-05-02 Thread Torin Woltjer
I am working on setting up Openstack for HA and one of the last orders of business is getting HA behavior out of the compute nodes. Is there a project that will automatically evacuate instances from a downed or failed compute host, and automatically reboot them on their new host? I'm curious wha

Re: [Openstack] SRIOV Enablement in openstack error

2018-05-02 Thread Moshe Levi
Hi Navdeep, The yellow lines that you outline are related to the ovs mechanism driver which indicate that it can't bind direct port which as expected. (/usr/lib/python2.7/dist-packages/neutron/plugins/ml2/drivers/mech_agent.py is port of the ovs mechanism driver) The error with the sriov me

[Openstack] SRIOV Enablement in openstack error

2018-05-02 Thread Navdeep Uniyal
Dear all, I am trying to enable the SRIOV interfaces in my openstack Pike installation. I am following the guide: https://docs.openstack.org/neutron/pike/admin/config-sriov.html I am getting following error in neutron while doing so: 2018-05-02 17:08:19.492 75833 ERROR neutron.plugins.ml2.manag