> -----Original Message----- > From: Willem de Bruijn [mailto:willemdebruijn.ker...@gmail.com] > Sent: Wednesday, December 9, 2020 10:43 PM > To: wangyunjian <wangyunj...@huawei.com> > Cc: Michael S. Tsirkin <m...@redhat.com>; Jason Wang > <jasow...@redhat.com>; virtualizat...@lists.linux-foundation.org; Network > Development <netdev@vger.kernel.org>; Lilijun (Jerry) > <jerry.lili...@huawei.com>; chenchanghu <chenchan...@huawei.com>; > xudingke <xudin...@huawei.com> > Subject: Re: [PATCH net v2] tun: fix ubuf refcount incorrectly on error path > > On Wed, Dec 9, 2020 at 8:03 AM wangyunjian <wangyunj...@huawei.com> > wrote: > > > > From: Yunjian Wang <wangyunj...@huawei.com> > > > > After setting callback for ubuf_info of skb, the callback > > (vhost_net_zerocopy_callback) will be called to decrease the refcount > > when freeing skb. But when an exception occurs > > With exception, you mean if tun_get_user returns an error that propagates to > the sendmsg call in vhost handle_tx, correct?
Yes > > > afterwards, the error handling in vhost handle_tx() will try to > > decrease the same refcount again. This is wrong and fix this by delay > > copying ubuf_info until we're sure there's no errors. > > I think the right approach is to address this in the error paths, rather than > complicate the normal datapath. > > Is it sufficient to suppress the call to vhost_net_ubuf_put in the handle_tx > sendmsg error path, given that vhost_zerocopy_callback will be called on > kfree_skb? We can not call kfree_skb() until the skb was created. > > Or alternatively clear the destructor in drop: The uarg->callback() is called immediately after we decide do datacopy even if caller want to do zerocopy. If another error occurs later, the vhost handle_tx() will try to decrease it again. Thanks > > > > > Fixes: 4477138fa0ae ("tun: properly test for IFF_UP") > > Fixes: 90e33d459407 ("tun: enable napi_gro_frags() for TUN/TAP > > driver") > > > > Signed-off-by: Yunjian Wang <wangyunj...@huawei.com> > > --- > > v2: > > Updated code, fix by delay copying ubuf_info > > --- > > drivers/net/tun.c | 29 +++++++++++++++++++---------- > > 1 file changed, 19 insertions(+), 10 deletions(-) > > > > diff --git a/drivers/net/tun.c b/drivers/net/tun.c index > > 2dc1988a8973..2ea822328e73 100644 > > --- a/drivers/net/tun.c > > +++ b/drivers/net/tun.c > > @@ -1637,6 +1637,20 @@ static struct sk_buff *tun_build_skb(struct > tun_struct *tun, > > return NULL; > > } > > > > +/* copy ubuf_info for callback when skb has no error */ static inline > > +void tun_copy_ubuf_info(struct sk_buff *skb, bool zerocopy, void > > +*msg_control) { > > + if (zerocopy) { > > + skb_shinfo(skb)->destructor_arg = msg_control; > > + skb_shinfo(skb)->tx_flags |= SKBTX_DEV_ZEROCOPY; > > + skb_shinfo(skb)->tx_flags |= SKBTX_SHARED_FRAG; > > + } else if (msg_control) { > > + struct ubuf_info *uarg = msg_control; > > + > > + uarg->callback(uarg, false); > > + } > > +} > > + > > /* Get packet from user space buffer */ static ssize_t > > tun_get_user(struct tun_struct *tun, struct tun_file *tfile, > > void *msg_control, struct iov_iter *from, > > @@ -1812,16 +1826,6 @@ static ssize_t tun_get_user(struct tun_struct > *tun, struct tun_file *tfile, > > break; > > } > > > > - /* copy skb_ubuf_info for callback when skb has no error */ > > - if (zerocopy) { > > - skb_shinfo(skb)->destructor_arg = msg_control; > > - skb_shinfo(skb)->tx_flags |= SKBTX_DEV_ZEROCOPY; > > - skb_shinfo(skb)->tx_flags |= SKBTX_SHARED_FRAG; > > - } else if (msg_control) { > > - struct ubuf_info *uarg = msg_control; > > - uarg->callback(uarg, false); > > - } > > - > > skb_reset_network_header(skb); > > skb_probe_transport_header(skb); > > skb_record_rx_queue(skb, tfile->queue_index); @@ -1830,6 > > +1834,7 @@ static ssize_t tun_get_user(struct tun_struct *tun, struct > tun_file *tfile, > > struct bpf_prog *xdp_prog; > > int ret; > > > > + tun_copy_ubuf_info(skb, zerocopy, msg_control); > > local_bh_disable(); > > rcu_read_lock(); > > xdp_prog = rcu_dereference(tun->xdp_prog); @@ > -1881,6 > > +1886,7 @@ static ssize_t tun_get_user(struct tun_struct *tun, struct > tun_file *tfile, > > return -ENOMEM; > > } > > > > + tun_copy_ubuf_info(skb, zerocopy, msg_control); > > local_bh_disable(); > > napi_gro_frags(&tfile->napi); > > local_bh_enable(); > > @@ -1889,6 +1895,7 @@ static ssize_t tun_get_user(struct tun_struct *tun, > struct tun_file *tfile, > > struct sk_buff_head *queue = > &tfile->sk.sk_write_queue; > > int queue_len; > > > > + tun_copy_ubuf_info(skb, zerocopy, msg_control); > > spin_lock_bh(&queue->lock); > > __skb_queue_tail(queue, skb); > > queue_len = skb_queue_len(queue); @@ -1899,8 > +1906,10 > > @@ static ssize_t tun_get_user(struct tun_struct *tun, struct tun_file > > *tfile, > > > > local_bh_enable(); > > } else if (!IS_ENABLED(CONFIG_4KSTACKS)) { > > + tun_copy_ubuf_info(skb, zerocopy, msg_control); > > tun_rx_batched(tun, tfile, skb, more); > > } else { > > + tun_copy_ubuf_info(skb, zerocopy, msg_control); > > netif_rx_ni(skb); > > } > > rcu_read_unlock(); > > -- > > 2.23.0 > >