** Description changed: [ Impact ] TBD. [ Test Plan ] Even though this problem happens only when using nested VMs with Intel CPUs, it is still recommended to perform the following tests on a bare metal machine also with an Intel CPU. In theory it should be possible to reproduce this on a host using an AMD CPU, but you'd have to explicitly tell LXD to create VMs with Intel CPUs. Credits to Guillaume Boutry for providing scripts automating most of the reproduction steps. Let's create two Noble VMs using LXD: $ lxc launch ubuntu:noble --vm --config limits.cpu=4 --config limits.memory=8GiB -d root,size=80GiB libvirt-1 $ lxc launch ubuntu:noble --vm --config limits.cpu=4 --config limits.memory=8GiB -d root,size=80GiB libvirt-2 You will need to generate an SSH keypair for the "ubuntu" user on libvirt-1 and install the public key on libvirt-2 so that "ssh libvirt-2.lxd" works. The rest of this test plan assumes you have done that. Inside libvirt-1: # apt update # apt install -y libvirt-daemon-system uuid # echo "host_uuid = \"00000000-0000-0000-0000-$(printf "%012x" "${RANDOM}")\"" >> /etc/libvirt/libvirtd.conf # systemctl restart libvirtd.service # su - ubuntu $ cd /tmp $ wget http://cloud-images.ubuntu.com/noble/current/noble-server-cloudimg-amd64.img $ sudo chown libvirt-qemu:kvm noble-server-cloudimg-amd64.img $ cd $ cat > domain.xml << _EOF_ <domain type="kvm"> - <uuid>$(uuidgen)</uuid> - <name>test-domain</name> - <memory>1048576</memory> - <vcpu>2</vcpu> - <os> - <type arch="x86_64" machine="pc">hvm</type> - <boot dev="hd"/> - </os> - <features> - <acpi/> - <apic/> - <vmcoreinfo/> - </features> - <clock offset="utc"> - <timer name="pit" tickpolicy="delay"/> - <timer name="rtc" tickpolicy="catchup"/> - <timer name="hpet" present="no"/> - </clock> - <cpu mode="host-model" match="exact"> - <topology sockets="2" cores="1" threads="1"/> - </cpu> - <devices> - <disk type="file" device="disk"> - <driver name="qemu" type="qcow2" cache="none"/> - <source file="/tmp/noble-server-cloudimg-amd64.img"/> - <target dev="vda" bus="virtio"/> - </disk> - <video> - <model type="qxl"/> - </video> - <rng model="virtio"> - <backend model="random">/dev/urandom</backend> - </rng> - <controller type="usb" index="0" model="none"/> - <memballoon model="virtio"> - <stats period="10"/> - </memballoon> - </devices> + <uuid>$(uuidgen)</uuid> + <name>test-domain</name> + <memory>1048576</memory> + <vcpu>2</vcpu> + <os> + <type arch="x86_64" machine="pc">hvm</type> + <boot dev="hd"/> + </os> + <features> + <acpi/> + <apic/> + <vmcoreinfo/> + </features> + <clock offset="utc"> + <timer name="pit" tickpolicy="delay"/> + <timer name="rtc" tickpolicy="catchup"/> + <timer name="hpet" present="no"/> + </clock> + <cpu mode="host-model" match="exact"> + <topology sockets="2" cores="1" threads="1"/> + </cpu> + <devices> + <disk type="file" device="disk"> + <driver name="qemu" type="qcow2" cache="none"/> + <source file="/tmp/noble-server-cloudimg-amd64.img"/> + <target dev="vda" bus="virtio"/> + </disk> + <video> + <model type="qxl"/> + </video> + <rng model="virtio"> + <backend model="random">/dev/urandom</backend> + </rng> + <controller type="usb" index="0" model="none"/> + <memballoon model="virtio"> + <stats period="10"/> + </memballoon> + </devices> </domain> _EOF_ $ virsh define domain.xml $ virsh start test-domain $ virsh dumpxml --migratable test-domain > migratable.xml Inside libvirt-2: # apt update # apt install -y libvirt-daemon-system # cd /tmp # wget http://cloud-images.ubuntu.com/noble/current/noble-server-cloudimg-amd64.img # chown libvirt-qemu:kvm noble-server-cloudimg-amd64.img # cd Now, back to libvirt-1, we are ready to test the migration: - $ virsh migrate test-domain qemu+ssh://libvirt-2/system --live + $ virsh migrate test-domain qemu+ssh://libvirt-2.lxd/system --live --persistent --undefinesource --copy-storage-inc --migrate-disks vda --persistent-xml migratable.xml --xml migratable.xml On Noble, you should see the following error: error: unsupported configuration: Target CPU feature count 28 does not match source 96 [ Where problems could occur ] TBD. [ Original Description ] This is issue is reproduced consistently from the snap-openstack- hypervisor built from https://git.launchpad.net/ubuntu/+source/libvirt@ubuntu/noble-updates (with patches applied). When creating a nova instance, live migrating between two hosts always fails because of: error: unsupported configuration: Target CPU feature count 44 does not match source 109 Command that reproduces a Nova migration using libvirt client (and reproduces the same error): virsh migrate instance-00000002 qemu+tls://juju-596fd1-1.lxd/system --live --p2p --persistent --undefinesource --copy-storage-inc --migrate- disks vda --xml migratable.xml --persistent-xml migratable.xml --bandwidth 0 Attached to this bug you will find: - instance.xml: domain dumped through virsh - migratable.xml: domain drump through virsh using --migratable (same flags as nova updated xml) - libvirtd.log: libvirt daemon debug logs showcasing why it refused to migrate As you can see in the logs from libvirtd.log, the method virDomainDefCheckABIStabilityFlags fails because the src has 65 VMX additional features that are not found on the destination. (Both hypervisors are hosted on LXD VMs on the same physical machines i.e. same cpu flags)
-- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/2083986 Title: Live migration fails because VMX features are missing on target cpu definition To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/libvirt/+bug/2083986/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs