Re: [RFC V1 1/6] Revert "vhost-backend: remove vhost_kernel_reset_device()"

Euan Turner Tue, 03 Sep 2024 05:04:59 -0700

Hi Steve,

On 30/08/2024 12:56, Steve Sistare wrote:

This reverts commit e6383293eb01928692047e617665a742cca87e23.
The reset function is needed for CPR.


Signed-off-by: Steve Sistare <steven.sist...@oracle.com>
---
  hw/virtio/vhost-backend.c | 6 ++++++
  1 file changed, 6 insertions(+)

diff --git a/hw/virtio/vhost-backend.c b/hw/virtio/vhost-backend.c
index 833804d..9b75141 100644
--- a/hw/virtio/vhost-backend.c
+++ b/hw/virtio/vhost-backend.c
@@ -221,6 +221,11 @@ static int vhost_kernel_set_owner(struct vhost_dev *dev)
      return vhost_kernel_call(dev, VHOST_SET_OWNER, NULL);
  }

+static int vhost_kernel_reset_device(struct vhost_dev *dev)

+{
+    return vhost_kernel_call(dev, VHOST_RESET_OWNER, NULL);
+}
+

How does this series avoid falling foul ofc0c4f147291f37765a5275aa24c3e1195468903b (which follows the commitreverted here)?

I've been playing around with this patch series a bit, in the context ofcpr-transfer, and am seeing the issues highlighted in that c0c4...commit message:Since vhost-kernel now has a reset_device, this is called invirtio_reset as part of qemu_machine_creation_done. (I have the fullbacktrace if it's helpful). Subsequent ioctls then fail (with ownershiperrors) due to the RESET_OWNER:


2024-09-02T15:40:56.860541Z qemu-kvm: vhost_set_vring_call failed 1
2024-09-02T15:40:56.860908Z qemu-kvm: vhost_set_vring_call failed 1

2024-09-02T15:40:56.861253Z qemu-kvm: vhost_set_mem_table failed:Operation not permitted (1)

2024-09-02T15:40:56.861586Z qemu-kvm: vhost_set_vring_call failed 1
2024-09-02T15:40:56.861831Z qemu-kvm: vhost_set_vring_call failed 1

2024-09-02T15:40:56.862199Z qemu-kvm: unable to start vhost net: 1:falling back on userspace virtio

For me the NIC then fails during the migration, although the migrationas a whole appears to succeed. (At least, prior the the migration, Icould ssh into the VM and ping out to 8.8.8.8, but then I lose the sshconnection during the migration, and cannot ssh back in again afterwardson the new QEMU).

Do you think this could be because of QEMU falling back from the vhostbackend to use virtio?

It may be down to some misconfiguration on my part, here's the netdevcommand line I had for reference:

On the source QEMU:

-netdev'{"type":"tap","fd":"39","vhost":true,"vhostfd":"40","id":"hostua-43bc0eaf-ff55-44e6-87ec-a4798f592db1"}'\-device'{"driver":"virtio-net-pci","rx_queue_size":256,"netdev":"hostua-43bc0eaf-ff55-44e6-87ec-a4798f592db1","id":"ua-43bc0eaf-ff55-44e6-87ec-a4798f592db1","mac":"50:6b:8d:0c:03:e0","bus":"pci.1","addr":"0x0"}'\


On the destination QEMU:

-netdev'{"type":"tap","fd":"-1","vhostfd":"-1","id":"hostua-43bc0eaf-ff55-44e6-87ec-a4798f592db1"}'\-device'{"driver":"virtio-net-pci","rx_queue_size":256,"netdev":"hostua-43bc0eaf-ff55-44e6-87ec-a4798f592db1","id":"ua-43bc0eaf-ff55-44e6-87ec-a4798f592db1","mac":"50:6b:8d:0c:03:e0","bus":"pci.1","addr":"0x0"}'\

  static int vhost_kernel_get_vq_index(struct vhost_dev *dev, int idx)
  {
      assert(idx >= dev->vq_index && idx < dev->vq_index + dev->nvqs);
@@ -345,6 +350,7 @@ const VhostOps kernel_ops = {
          .vhost_get_features = vhost_kernel_get_features,
          .vhost_set_backend_cap = vhost_kernel_set_backend_cap,
          .vhost_set_owner = vhost_kernel_set_owner,
+        .vhost_reset_device = vhost_kernel_reset_device,
          .vhost_get_vq_index = vhost_kernel_get_vq_index,
          .vhost_vsock_set_guest_cid = vhost_kernel_vsock_set_guest_cid,
          .vhost_vsock_set_running = vhost_kernel_vsock_set_running,


Thanks,
Euan

Re: [RFC V1 1/6] Revert "vhost-backend: remove vhost_kernel_reset_device()"

Reply via email to