On 02.12.2023 23:22, Michael S. Tsirkin wrote:
> On Fri, Dec 01, 2023 at 01:40:41PM +0300, Arseniy Krasnov wrote:
>>
>>
>> On 01.12.2023 12:48, Stefano Garzarella wrote:
>>> On Fri, Dec 01, 2023 at 11:35:56AM +0300, Arseniy Krasnov wrote:
>>>>
>>>>
>>>> On 01.12.2023 11:27, Stefano Garzarella wrote:
>>>>> On Thu, Nov 30, 2023 at 12:40:43PM -0500, Michael S. Tsirkin wrote:
>>>>>> On Thu, Nov 30, 2023 at 03:11:19PM +0100, Stefano Garzarella wrote:
>>>>>>> On Thu, Nov 30, 2023 at 08:58:58AM -0500, Michael S. Tsirkin wrote:
>>>>>>>> On Thu, Nov 30, 2023 at 04:43:34PM +0300, Arseniy Krasnov wrote:
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> On 30.11.2023 16:42, Michael S. Tsirkin wrote:
>>>>>>>>>> On Thu, Nov 30, 2023 at 04:08:39PM +0300, Arseniy Krasnov wrote:
>>>>>>>>>>> Send credit update message when SO_RCVLOWAT is updated and it is 
>>>>>>>>>>> bigger
>>>>>>>>>>> than number of bytes in rx queue. It is needed, because 'poll()' 
>>>>>>>>>>> will
>>>>>>>>>>> wait until number of bytes in rx queue will be not smaller than
>>>>>>>>>>> SO_RCVLOWAT, so kick sender to send more data. Otherwise mutual 
>>>>>>>>>>> hungup
>>>>>>>>>>> for tx/rx is possible: sender waits for free space and receiver is
>>>>>>>>>>> waiting data in 'poll()'.
>>>>>>>>>>>
>>>>>>>>>>> Signed-off-by: Arseniy Krasnov <avkras...@salutedevices.com>
>>>>>>>>>>> ---
>>>>>>>>>>>   Changelog:
>>>>>>>>>>>   v1 -> v2:
>>>>>>>>>>>    * Update commit message by removing 'This patch adds XXX' manner.
>>>>>>>>>>>    * Do not initialize 'send_update' variable - set it directly 
>>>>>>>>>>> during
>>>>>>>>>>>      first usage.
>>>>>>>>>>>   v3 -> v4:
>>>>>>>>>>>    * Fit comment in 'virtio_transport_notify_set_rcvlowat()' to 80 
>>>>>>>>>>> chars.
>>>>>>>>>>>   v4 -> v5:
>>>>>>>>>>>    * Do not change callbacks order in transport structures.
>>>>>>>>>>>
>>>>>>>>>>>   drivers/vhost/vsock.c                   |  1 +
>>>>>>>>>>>   include/linux/virtio_vsock.h            |  1 +
>>>>>>>>>>>   net/vmw_vsock/virtio_transport.c        |  1 +
>>>>>>>>>>>   net/vmw_vsock/virtio_transport_common.c | 27 
>>>>>>>>>>> +++++++++++++++++++++++++
>>>>>>>>>>>   net/vmw_vsock/vsock_loopback.c          |  1 +
>>>>>>>>>>>   5 files changed, 31 insertions(+)
>>>>>>>>>>>
>>>>>>>>>>> diff --git a/drivers/vhost/vsock.c b/drivers/vhost/vsock.c
>>>>>>>>>>> index f75731396b7e..4146f80db8ac 100644
>>>>>>>>>>> --- a/drivers/vhost/vsock.c
>>>>>>>>>>> +++ b/drivers/vhost/vsock.c
>>>>>>>>>>> @@ -451,6 +451,7 @@ static struct virtio_transport vhost_transport 
>>>>>>>>>>> = {
>>>>>>>>>>>           .notify_buffer_size       = 
>>>>>>>>>>> virtio_transport_notify_buffer_size,
>>>>>>>>>>>
>>>>>>>>>>>           .read_skb = virtio_transport_read_skb,
>>>>>>>>>>> +        .notify_set_rcvlowat      = 
>>>>>>>>>>> virtio_transport_notify_set_rcvlowat
>>>>>>>>>>>       },
>>>>>>>>>>>
>>>>>>>>>>>       .send_pkt = vhost_transport_send_pkt,
>>>>>>>>>>> diff --git a/include/linux/virtio_vsock.h 
>>>>>>>>>>> b/include/linux/virtio_vsock.h
>>>>>>>>>>> index ebb3ce63d64d..c82089dee0c8 100644
>>>>>>>>>>> --- a/include/linux/virtio_vsock.h
>>>>>>>>>>> +++ b/include/linux/virtio_vsock.h
>>>>>>>>>>> @@ -256,4 +256,5 @@ void virtio_transport_put_credit(struct 
>>>>>>>>>>> virtio_vsock_sock *vvs, u32 credit);
>>>>>>>>>>>   void virtio_transport_deliver_tap_pkt(struct sk_buff *skb);
>>>>>>>>>>>   int virtio_transport_purge_skbs(void *vsk, struct sk_buff_head 
>>>>>>>>>>> *list);
>>>>>>>>>>>   int virtio_transport_read_skb(struct vsock_sock *vsk, 
>>>>>>>>>>> skb_read_actor_t read_actor);
>>>>>>>>>>> +int virtio_transport_notify_set_rcvlowat(struct vsock_sock *vsk, 
>>>>>>>>>>> int val);
>>>>>>>>>>>   #endif /* _LINUX_VIRTIO_VSOCK_H */
>>>>>>>>>>> diff --git a/net/vmw_vsock/virtio_transport.c 
>>>>>>>>>>> b/net/vmw_vsock/virtio_transport.c
>>>>>>>>>>> index af5bab1acee1..8007593a3a93 100644
>>>>>>>>>>> --- a/net/vmw_vsock/virtio_transport.c
>>>>>>>>>>> +++ b/net/vmw_vsock/virtio_transport.c
>>>>>>>>>>> @@ -539,6 +539,7 @@ static struct virtio_transport virtio_transport 
>>>>>>>>>>> = {
>>>>>>>>>>>           .notify_buffer_size       = 
>>>>>>>>>>> virtio_transport_notify_buffer_size,
>>>>>>>>>>>
>>>>>>>>>>>           .read_skb = virtio_transport_read_skb,
>>>>>>>>>>> +        .notify_set_rcvlowat      = 
>>>>>>>>>>> virtio_transport_notify_set_rcvlowat
>>>>>>>>>>>       },
>>>>>>>>>>>
>>>>>>>>>>>       .send_pkt = virtio_transport_send_pkt,
>>>>>>>>>>> diff --git a/net/vmw_vsock/virtio_transport_common.c 
>>>>>>>>>>> b/net/vmw_vsock/virtio_transport_common.c
>>>>>>>>>>> index f6dc896bf44c..1cb556ad4597 100644
>>>>>>>>>>> --- a/net/vmw_vsock/virtio_transport_common.c
>>>>>>>>>>> +++ b/net/vmw_vsock/virtio_transport_common.c
>>>>>>>>>>> @@ -1684,6 +1684,33 @@ int virtio_transport_read_skb(struct 
>>>>>>>>>>> vsock_sock *vsk, skb_read_actor_t recv_acto
>>>>>>>>>>>   }
>>>>>>>>>>>   EXPORT_SYMBOL_GPL(virtio_transport_read_skb);
>>>>>>>>>>>
>>>>>>>>>>> +int virtio_transport_notify_set_rcvlowat(struct vsock_sock *vsk,
>>>>>>>>>>> int val)
>>>>>>>>>>> +{
>>>>>>>>>>> +    struct virtio_vsock_sock *vvs = vsk->trans;
>>>>>>>>>>> +    bool send_update;
>>>>>>>>>>> +
>>>>>>>>>>> +    spin_lock_bh(&vvs->rx_lock);
>>>>>>>>>>> +
>>>>>>>>>>> +    /* If number of available bytes is less than new SO_RCVLOWAT 
>>>>>>>>>>> value,
>>>>>>>>>>> +     * kick sender to send more data, because sender may sleep in
>>>>>>>>>>> its
>>>>>>>>>>> +     * 'send()' syscall waiting for enough space at our side.
>>>>>>>>>>> +     */
>>>>>>>>>>> +    send_update = vvs->rx_bytes < val;
>>>>>>>>>>> +
>>>>>>>>>>> +    spin_unlock_bh(&vvs->rx_lock);
>>>>>>>>>>> +
>>>>>>>>>>> +    if (send_update) {
>>>>>>>>>>> +        int err;
>>>>>>>>>>> +
>>>>>>>>>>> +        err = virtio_transport_send_credit_update(vsk);
>>>>>>>>>>> +        if (err < 0)
>>>>>>>>>>> +            return err;
>>>>>>>>>>> +    }
>>>>>>>>>>> +
>>>>>>>>>>> +    return 0;
>>>>>>>>>>> +}
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>> I find it strange that this will send a credit update
>>>>>>>>>> even if nothing changed since this was called previously.
>>>>>>>>>> I'm not sure whether this is a problem protocol-wise,
>>>>>>>>>> but it certainly was not envisioned when the protocol was
>>>>>>>>>> built. WDYT?
>>>>>>>>>
>>>>>>>>> >From virtio spec I found:
>>>>>>>>>
>>>>>>>>> It is also valid to send a VIRTIO_VSOCK_OP_CREDIT_UPDATE packet 
>>>>>>>>> without previously receiving a
>>>>>>>>> VIRTIO_VSOCK_OP_CREDIT_REQUEST packet. This allows communicating 
>>>>>>>>> updates any time a change
>>>>>>>>> in buffer space occurs.
>>>>>>>>> So I guess there is no limitations to send such type of packet, e.g. 
>>>>>>>>> it is not
>>>>>>>>> required to be a reply for some another packet. Please, correct me if 
>>>>>>>>> im wrong.
>>>>>>>>>
>>>>>>>>> Thanks, Arseniy
>>>>>>>>
>>>>>>>>
>>>>>>>> Absolutely. My point was different - with this patch it is possible
>>>>>>>> that you are not adding any credits at all since the previous
>>>>>>>> VIRTIO_VSOCK_OP_CREDIT_UPDATE.
>>>>>>>
>>>>>>> I think the problem we're solving here is that since as an optimization 
>>>>>>> we
>>>>>>> avoid sending the update for every byte we consume, but we put a 
>>>>>>> threshold,
>>>>>>> then we make sure we update the peer.
>>>>>>>
>>>>>>> A credit update contains a snapshot and sending it the same as the 
>>>>>>> previous
>>>>>>> one should not create any problem.
>>>>>>
>>>>>> Well it consumes a buffer on the other side.
>>>>>
>>>>> Sure, but we are already speculating by not updating the other side when
>>>>> we consume bytes before a certain threshold. This already avoids to
>>>>> consume many buffers.
>>>>>
>>>>> Here we're only sending it once, when the user sets RCVLOWAT, so
>>>>> basically I expect it won't affect performance.
>>>>
>>>> Moreover I think in practice setting RCVLOWAT is rare case, while this 
>>>> patch
>>>> fixes real problem I guess
>>>>
>>>>
>>>>>
>>>>>>
>>>>>>> My doubt now is that we only do this when we set RCVLOWAT , should we 
>>>>>>> also
>>>>>>> do something when we consume bytes to avoid the optimization we have?
>>>>>>>
>>>>>>> Stefano
>>>>>>
>>>>>> Isn't this why we have credit request?
>>>>>
>>>>> Yep, but in practice we never use it. It would also consume 2 buffers,
>>>>> one at the transmitter and one at the receiver.
>>>>>
>>>>> However I agree that maybe we should start using it before we decide not
>>>>> to send any more data.
>>>>>
>>>>> To be compatible with older devices, though, I think for now we also
>>>>> need to send a credit update when the bytes in the receive queue are
>>>>> less than RCVLOWAT, as Arseniy proposed in the other series.
>>>>
>>>> Looks like (in theory of course), that credit request is considered to be
>>>> paired with credit update. While current usage of credit update is 
>>>> something
>>>> like ACK packet in TCP, e.g. telling peer that we are ready to receive more
>>>> data.
>>>
>>> I don't honestly know what the original author's choice was, but I think we 
>>> reduce latency this way.
>>
>> Ah I see,ok
>>
>>>
>>> Effectively though, if we never send any credit update when we consume 
>>> bytes and always leave it up to the transmitter to ask for an update before 
>>> transmission, we save even more buffer than the optimization we have, but 
>>> maybe the latency would grow a lot.
>>
>> I think:
>> 1) Way where sender must request current credit status before sending packet 
>> requires rework of kernel part, and for me this approach is not
>>    so clear than current simple implementation (send RW, reply with 
>> CREDIT_UPDATE if needed).
>> 2) In theory yes, we need one more buffer for such CREDIT_UPDATE, but in 
>> practice I don't know how big is this trouble.
>>
>> Thanks, Arseniy
> 
> I just worry that yes, normal users will only call RCVLOWAT once,
> but a bad user might call it many times causing a ton of
> credit updates. This is why I feel it's prudent to at least
> keep track of last credit update and if nothing changed
> do not resend it on a repeated RCVLOWAT.

I see, agree, I'll add this check in the next version!

Thanks, Arseniy

> 
> 
> 
> 
>>>
>>> Stefano
>>>
> 

Reply via email to