> -----Original Message-----
> From: Andrew Rybchenko <andrew.rybche...@oktetlabs.ru>
> Sent: Monday, October 3, 2022 22:21
> To: Guo, Junfeng <junfeng....@intel.com>; Zhang, Qi Z
> <qi.z.zh...@intel.com>; Wu, Jingjing <jingjing...@intel.com>; Xing,
> Beilei <beilei.x...@intel.com>
> Cc: dev@dpdk.org; Wang, Xiao W <xiao.w.w...@intel.com>; Wu,
> Wenjun1 <wenjun1...@intel.com>; Thomas Monjalon
> <tho...@monjalon.net>
> Subject: Re: [PATCH v2 13/14] net/idpf: add AVX512 data path for single
> queue model
> 
> On 9/5/22 13:58, Junfeng Guo wrote:
> > Add support of AVX512 vector data path for single queue model.
> >
> > Signed-off-by: Wenjun Wu <wenjun1...@intel.com>
> > Signed-off-by: Junfeng Guo <junfeng....@intel.com>
> 
> [snip]
> 
> > +static __rte_always_inline void
> > +idpf_singleq_rearm(struct idpf_rx_queue *rxq)
> > +{
> 
> [snip]
> 
> > +   const __m512i iova_offsets =  _mm512_set1_epi64(offsetof
> > +                                                   (struct rte_mbuf,
> buf_iova));
> > +   const __m512i headroom =
> _mm512_set1_epi64(RTE_PKTMBUF_HEADROOM);
> > +
> > +#ifndef RTE_LIBRTE_IDPF_16BYTE_RX_DESC
> 
> As far as I remember, support for such build time options is
> deprecated in DPDK and new options are not accepted.
> Cc Thomas to correct me if I'm wrong.

Yes, we will also remove this MACRO in the coming version. Thanks!

> 
> > +   /* to shuffle the addresses to correct slots. Values 4-7 will
> contain
> > +    * zeros, so use 7 for a zero-value.
> > +    */
> > +   const __m512i permute_idx = _mm512_set_epi64(7, 7, 3, 1, 7, 7,
> 2, 0);
> > +#else
> > +   const __m512i permute_idx = _mm512_set_epi64(7, 3, 6, 2, 5, 1,
> 4, 0);
> > +#endif
> 
> [snip]
> 
> > +#ifdef IDPF_RX_PTYPE_OFFLOAD

This one is only a local MACRO defined in this file. So it may have no impact.
Thanks!

> 
> Same here.
> 
> > +           /**
> > +            * to get packet types, shift 64-bit values down 30 bits
> > +            * and so ptype is in lower 8-bits in each
> > +            */
> > +           const __m512i ptypes4_7 = _mm512_srli_epi64(desc4_7,
> 16);
> > +           const __m256i ptypes6_7 =
> _mm512_extracti64x4_epi64(ptypes4_7, 1);
> > +           const __m256i ptypes4_5 =
> _mm512_extracti64x4_epi64(ptypes4_7, 0);
> > +           const uint8_t ptype7 = _mm256_extract_epi8(ptypes6_7,
> 16);
> > +           const uint8_t ptype6 = _mm256_extract_epi8(ptypes6_7,
> 0);
> > +           const uint8_t ptype5 = _mm256_extract_epi8(ptypes4_5,
> 16);
> > +           const uint8_t ptype4 = _mm256_extract_epi8(ptypes4_5,
> 0);
> > +
> > +           const __m512i ptype4_7 = _mm512_set_epi32
> > +                   (0, 0, 0, type_table[ptype7],
> > +                    0, 0, 0, type_table[ptype6],
> > +                    0, 0, 0, type_table[ptype5],
> > +                    0, 0, 0, type_table[ptype4]);
> > +           mb4_7 = _mm512_mask_blend_epi32(0x1111, mb4_7,
> ptype4_7);
> > +#endif
> 
> [snip]

Reply via email to