> -----Original Message----- > From: Joyce Kong <joyce.k...@arm.com> > Sent: Tuesday, January 26, 2021 5:58 PM > To: maxime.coque...@redhat.com; david.march...@redhat.com; > i.maxim...@ovn.org; Ruifeng Wang <ruifeng.w...@arm.com>; Honnappa > Nagarahalli <honnappa.nagaraha...@arm.com> > Cc: dev@dpdk.org; nd <n...@arm.com>; Subhi Masri <sub...@nvidia.com> > Subject: [PATCH v2] net/virtio: fix compiling issue for vectorized NEON path > > In file included from ../drivers/net/virtio/virtio_rxtx_packed.c:22:0: > ../drivers/net/virtio/virtio_rxtx_packed_neon.h: In function > ‘virtqueue_enqueue_batch_packed_vec’: > ../drivers/net/virtio/virtio_rxtx_packed_neon.h:74:2: warning: > implicit declaration of function ‘vreinterpretq_p128_u32’ > [-Wimplicit-function-declaration] > poly128_t cmp1 = vreinterpretq_p128_u32(~vceqq_u32(ref_seg, > def_ref_seg)); ^ > > The message shows ‘vreinterpretq_p128_u32’ intrinsic is not supported > because an old version gcc (gcc 4.8.5) is used. So fix the issue with > implementing the logic with other intrinsics. > > Bugzilla ID: 621 > Fixes: 530887469350 ("net/virtio: add vectorized packed ring NEON Tx") > Fixes: 5971ce5e2a59 ("net/virtio: add vectorized packed ring NEON Rx") > > Reported-by: Subhi Masri <sub...@nvidia.com> > Signed-off-by: Joyce Kong <joyce.k...@arm.com> > --- > drivers/net/virtio/virtio_rxtx_packed_neon.h | 8 ++++---- > 1 file changed, 4 insertions(+), 4 deletions(-) > > diff --git a/drivers/net/virtio/virtio_rxtx_packed_neon.h > b/drivers/net/virtio/virtio_rxtx_packed_neon.h > index 01c77b712..00dd04277 100644 > --- a/drivers/net/virtio/virtio_rxtx_packed_neon.h > +++ b/drivers/net/virtio/virtio_rxtx_packed_neon.h > @@ -71,8 +71,8 @@ virtqueue_enqueue_batch_packed_vec(struct > virtnet_tx *txvq, > uint32x4_t def_ref_seg = vdupq_n_u32(0x10001); > /* Check refcnt and nb_segs. */ > uint32x4_t ref_seg = vreinterpretq_u32_u8(vqtbl2q_u8(mbuf, > ref_seg_msk)); > - poly128_t cmp1 = vreinterpretq_p128_u32(~vceqq_u32(ref_seg, > def_ref_seg)); > - if (unlikely(cmp1)) > + uint64x2_t cmp1 = vreinterpretq_u64_u32(~vceqq_u32(ref_seg, > def_ref_seg)); > + if (unlikely(vgetq_lane_u64(cmp1, 0) || vgetq_lane_u64(cmp1, 1))) > return -1; > > /* Check headroom is enough. */ > @@ -225,10 +225,10 @@ virtqueue_dequeue_batch_packed_vec(struct > virtnet_rx *rxvq, > if (vq->vq_packed.used_wrap_counter) > v_used_flag = vdupq_n_u32(PACKED_FLAGS_MASK); > > - poly128_t desc_stats = vreinterpretq_p128_u32(~vceqq_u32(v_flag, > v_used_flag)); > + uint64x2_t desc_stats = vreinterpretq_u64_u32(~vceqq_u32(v_flag, > +v_used_flag)); > > /* Check all descs are used. */ > - if (desc_stats) > + if (unlikely(vgetq_lane_u64(desc_stats, 0) || > +vgetq_lane_u64(desc_stats, 1))) > return -1; > > /* Load 2 mbuf pointers per time. */ > -- > 2.30.0
Reviewed-by: Ruifeng Wang <ruifeng.w...@arm.com>