On 1/27/21 11:37 AM, Subhi Masri wrote:
> Hi,
>
>> -----Original Message-----
>> From: Maxime Coquelin <maxime.coque...@redhat.com>
>> Sent: Wednesday, January 27, 2021 10:44 AM
>> To: Subhi Masri <sub...@nvidia.com>
>> Cc: dev@dpdk.org; n...@arm.com; Joyce Kong <joyce.k...@arm.com>;
>> david.march...@redhat.com; honnappa.nagaraha...@arm.com;
>> ruifeng.w...@arm.com; i.maxim...@ovn.org
>> Subject: Re: [PATCH v2] net/virtio: fix compiling issue for vectorized NEON
>> path
>>
>> External email: Use caution opening links or attachments
>>
>>
>> Hi Subhi,
>>
>> On 1/26/21 10:57 AM, Joyce Kong wrote:
>>> In file included from ../drivers/net/virtio/virtio_rxtx_packed.c:22:0:
>>> ../drivers/net/virtio/virtio_rxtx_packed_neon.h: In function
>>> ‘virtqueue_enqueue_batch_packed_vec’:
>>> ../drivers/net/virtio/virtio_rxtx_packed_neon.h:74:2: warning:
>>> implicit declaration of function ‘vreinterpretq_p128_u32’
>>> [-Wimplicit-function-declaration]
>>> poly128_t cmp1 = vreinterpretq_p128_u32(~vceqq_u32(ref_seg,
>>> def_ref_seg)); ^
>>>
>>> The message shows ‘vreinterpretq_p128_u32’ intrinsic is not supported
>>> because an old version gcc (gcc 4.8.5) is used. So fix the issue with
>>> implementing the logic with other intrinsics.
>>>
>>> Bugzilla ID: 621
>>> Fixes: 530887469350 ("net/virtio: add vectorized packed ring NEON Tx")
>>> Fixes: 5971ce5e2a59 ("net/virtio: add vectorized packed ring NEON Rx")
>>>
>>> Reported-by: Subhi Masri <sub...@nvidia.com>
>>> Signed-off-by: Joyce Kong <joyce.k...@arm.com>
>>> ---
>>> drivers/net/virtio/virtio_rxtx_packed_neon.h | 8 ++++----
>>> 1 file changed, 4 insertions(+), 4 deletions(-)
>>>
>>> diff --git a/drivers/net/virtio/virtio_rxtx_packed_neon.h
>>> b/drivers/net/virtio/virtio_rxtx_packed_neon.h
>>> index 01c77b712..00dd04277 100644
>>> --- a/drivers/net/virtio/virtio_rxtx_packed_neon.h
>>> +++ b/drivers/net/virtio/virtio_rxtx_packed_neon.h
>>> @@ -71,8 +71,8 @@ virtqueue_enqueue_batch_packed_vec(struct
>> virtnet_tx *txvq,
>>> uint32x4_t def_ref_seg = vdupq_n_u32(0x10001);
>>> /* Check refcnt and nb_segs. */
>>> uint32x4_t ref_seg = vreinterpretq_u32_u8(vqtbl2q_u8(mbuf,
>> ref_seg_msk));
>>> - poly128_t cmp1 = vreinterpretq_p128_u32(~vceqq_u32(ref_seg,
>> def_ref_seg));
>>> - if (unlikely(cmp1))
>>> + uint64x2_t cmp1 = vreinterpretq_u64_u32(~vceqq_u32(ref_seg,
>> def_ref_seg));
>>> + if (unlikely(vgetq_lane_u64(cmp1, 0) || vgetq_lane_u64(cmp1,
>>> + 1)))
>>> return -1;
>>>
>>> /* Check headroom is enough. */
>>> @@ -225,10 +225,10 @@ virtqueue_dequeue_batch_packed_vec(struct
>> virtnet_rx *rxvq,
>>> if (vq->vq_packed.used_wrap_counter)
>>> v_used_flag = vdupq_n_u32(PACKED_FLAGS_MASK);
>>>
>>> - poly128_t desc_stats = vreinterpretq_p128_u32(~vceqq_u32(v_flag,
>> v_used_flag));
>>> + uint64x2_t desc_stats = vreinterpretq_u64_u32(~vceqq_u32(v_flag,
>>> + v_used_flag));
>>>
>>> /* Check all descs are used. */
>>> - if (desc_stats)
>>> + if (unlikely(vgetq_lane_u64(desc_stats, 0) ||
>>> + vgetq_lane_u64(desc_stats, 1)))
>>> return -1;
>>>
>>> /* Load 2 mbuf pointers per time. */
>>>
>>
>> Can you confirm the patch works for you?
>>
> Yes it does , you can add my tag.
> Tested-by: Subhi Masri <sub...@nvidia.com>
Thanks!
Maxime
>> Thanks,
>> Maxime
>
>
> Regards,
> Subhi Masri
>