OK. Will send v4 later. Thanks.

> -----Original Message-----
> From: Zhang, Qi Z
> Sent: Wednesday, September 5, 2018 20:22
> To: Li, Xiaoyun <xiaoyun...@intel.com>; Xing, Beilei <beilei.x...@intel.com>
> Cc: dev@dpdk.org; Yang, Zhiyong <zhiyong.y...@intel.com>; Richardson,
> Bruce <bruce.richard...@intel.com>; Hunt, David <david.h...@intel.com>
> Subject: RE: [PATCH v3] net/i40e: add interface to choose latest vector path
> 
> Hi Xiaoyun:
> 
> > -----Original Message-----
> > From: Li, Xiaoyun
> > Sent: Tuesday, September 4, 2018 7:40 PM
> > To: Xing, Beilei <beilei.x...@intel.com>; Zhang, Qi Z
> > <qi.z.zh...@intel.com>
> > Cc: dev@dpdk.org; Yang, Zhiyong <zhiyong.y...@intel.com>; Richardson,
> > Bruce <bruce.richard...@intel.com>; Hunt, David
> > <david.h...@intel.com>; Li, Xiaoyun <xiaoyun...@intel.com>
> > Subject: [PATCH v3] net/i40e: add interface to choose latest vector
> > path
> >
> > Right now, vector path is limited to only use on later platform.
> > This patch adds a devarg enable-latest-vec to allow the users to use
> > the latest vector path that the platform supported. Namely, using AVX2
> > vector path on broadwell is possible.
> >
> > Signed-off-by: Xiaoyun Li <xiaoyun...@intel.com>
> > ---
> > v3:
> >  * Polish the doc and commit log.
> > v2:
> >  * Correct the calling of the wrong function last time.
> >  * Fix seg fault bug.
> >
> >  doc/guides/nics/i40e.rst               |  8 ++++++
> >  doc/guides/rel_notes/release_18_11.rst |  4 +++
> >  drivers/net/i40e/i40e_ethdev.c         | 38
> > ++++++++++++++++++++++++++
> >  drivers/net/i40e/i40e_ethdev.h         |  1 +
> >  drivers/net/i40e/i40e_rxtx.c           | 27 ++++++++++++++++++
> >  5 files changed, 78 insertions(+)
> >
> > diff --git a/doc/guides/nics/i40e.rst b/doc/guides/nics/i40e.rst index
> > 65d87f869..6158e7c34 100644
> > --- a/doc/guides/nics/i40e.rst
> > +++ b/doc/guides/nics/i40e.rst
> > @@ -163,6 +163,14 @@ Runtime Config Options
> >    Currently hot-plugging of representor ports is not supported so all
> required
> >    representors must be specified on the creation of the PF.
> >
> > +- ``Enable latest vector`` (default ``disable``)
> > +
> > +  Vector path was limited to use only on later platform. But users
> > + may want the  latest vector path. For example, VPP users may want to
> > + use
> > + AVX2 vector path on HSW/BDW  because it can get better perf. So
> > + ``devargs`` parameter ``enable-latest-vec``  is introduced, for example::
> > +    -w 84:00.0,enable-latest-vec=1
> 
> 
> How about "use_latest_vec" or "use-lastest-vpmd"?
> 
> > +
> >  Driver compilation and testing
> >  ------------------------------
> >
> > diff --git a/doc/guides/rel_notes/release_18_11.rst
> > b/doc/guides/rel_notes/release_18_11.rst
> > index 3ae6b3f58..f8b0f3189 100644
> > --- a/doc/guides/rel_notes/release_18_11.rst
> > +++ b/doc/guides/rel_notes/release_18_11.rst
> > @@ -54,6 +54,10 @@ New Features
> >       Also, make sure to start the actual text at the margin.
> >       =========================================================
> >
> > +* **Added a devarg to eable the latest vector path.**
> > +  A new devarg ``enable-latest-vec`` was introduced to allow users to
> > +choose
> > +  the latest vector path that the platform supported. For example,
> > +VPP users
> > +  can use AVX2 vector path on BDW/HSW to get better performance.
> >
> >  API Changes
> >  -----------
> > diff --git a/drivers/net/i40e/i40e_ethdev.c
> > b/drivers/net/i40e/i40e_ethdev.c index 85a6a867f..16b5345fb 100644
> > --- a/drivers/net/i40e/i40e_ethdev.c
> > +++ b/drivers/net/i40e/i40e_ethdev.c
> > @@ -12513,6 +12513,44 @@ i40e_config_rss_filter(struct i40e_pf *pf,
> >     return 0;
> >  }
> >
> > +#define ETH_I40E_ENABLE_LATEST_VEC "enable-latest-vec"
> 
> This should be defined along with other exist devargs, please check
> ETH_I40E_SUPPORT_MULTI_DRIVER for reference Also it should be
> registered with RTE_PMD_REGISTER_PARAM_STRING.
> 
> > +
> > +bool
> > +i40e_parse_latest_vec(struct rte_eth_dev *dev) {
> > +   static const char *const valid_keys[] = {
> > +           ETH_I40E_ENABLE_LATEST_VEC, NULL};
> > +   int enable_latest_vec;
> > +   struct rte_kvargs *kvlist;
> > +
> > +   if (!dev->device->devargs)
> > +           return 0;
> > +
> > +   kvlist = rte_kvargs_parse(dev->device->devargs->args, valid_keys);
> > +   if (!kvlist)
> > +           return -EINVAL;
> > +
> > +   if (!rte_kvargs_count(kvlist, ETH_I40E_ENABLE_LATEST_VEC))
> > +           return 0;
> > +
> > +   if (rte_kvargs_count(kvlist, ETH_I40E_ENABLE_LATEST_VEC) > 1)
> > +           PMD_DRV_LOG(WARNING, "More than one argument \"%s\"
> and
> > only "
> > +                       "the first one is used !",
> > +                       ETH_I40E_ENABLE_LATEST_VEC);
> > +
> > +   enable_latest_vec = atoi((&kvlist->pairs[0])->value);
> > +
> > +   rte_kvargs_free(kvlist);
> > +
> > +   if (enable_latest_vec != 0 && enable_latest_vec != 1)
> > +           PMD_DRV_LOG(WARNING, "Value should be 0 or 1, set it as
> 1!");
> > +
> > +   if (enable_latest_vec)
> > +           return true;
> > +   else
> > +           return false;
> > +}
> 
> We call rte_kvargs_parse in different place for different parameter which is
> not necessary.
> it's better to merge them into one parse_devargs function at dev_init and
> then all corresponding field of i40e_adapter can be configured at the same
> place.
> Though this is not this patch's scope, but it's better to introduce a field 
> like
> i40e_adapter->use_latest_vec and in i40e_parse_latest_vec, it just assign the
> value which could be used later.
> This will make things easy for future code clean and also it is not necessary 
> to
> call i40e_parse_latest_vec multiple time in set_rx/tx_function.
> 
> > +
> >  RTE_INIT(i40e_init_log)
> >  {
> >     i40e_logtype_init = rte_log_register("pmd.net.i40e.init");
> > diff --git a/drivers/net/i40e/i40e_ethdev.h
> > b/drivers/net/i40e/i40e_ethdev.h index 3fffe5a55..cdf68cd93 100644
> > --- a/drivers/net/i40e/i40e_ethdev.h
> > +++ b/drivers/net/i40e/i40e_ethdev.h
> > @@ -1243,6 +1243,7 @@ int i40e_config_rss_filter(struct i40e_pf *pf,
> >             struct i40e_rte_flow_rss_conf *conf, bool add);  int
> > i40e_vf_representor_init(struct rte_eth_dev *ethdev, void
> > *init_params);  int i40e_vf_representor_uninit(struct rte_eth_dev
> > *ethdev);
> > +bool i40e_parse_latest_vec(struct rte_eth_dev *dev);
> >
> >  #define I40E_DEV_TO_PCI(eth_dev) \
> >     RTE_DEV_TO_PCI((eth_dev)->device)
> > diff --git a/drivers/net/i40e/i40e_rxtx.c
> > b/drivers/net/i40e/i40e_rxtx.c index
> > 2a28ee348..75f8ec284 100644
> > --- a/drivers/net/i40e/i40e_rxtx.c
> > +++ b/drivers/net/i40e/i40e_rxtx.c
> > @@ -2960,6 +2960,15 @@ i40e_set_rx_function(struct rte_eth_dev *dev)
> >                     if
> (rte_cpu_get_flag_enabled(RTE_CPUFLAG_AVX512F))
> >                             dev->rx_pkt_burst =
> >                                     i40e_recv_scattered_pkts_vec_avx2;
> > +                   /*
> > +                    * Give users chance to use the latest vector path
> > +                    * that the platform supported.
> > +                    */
> > +                   if (i40e_parse_latest_vec(dev)) {
> > +                           if
> (rte_cpu_get_flag_enabled(RTE_CPUFLAG_AVX2))
> > +                                   dev->rx_pkt_burst =
> > +                                   i40e_recv_scattered_pkts_vec_avx2;
> > +                   }
> 
> 
> How about:
> 
>                       if (adapter->use_latest_vec)
>                               dev->pkt_burst = get_latest_rx_vec();
>                       else
>                               dev->pkt_burst = get_recommended_rx_vec();
> 
> this make code easy to understand and in future, if we have avx512 vpmd,
> it's easy for us to modify the code.
> 
> 
> >  #endif
> >             } else {
> >                     PMD_INIT_LOG(DEBUG, "Using a Scattered with bulk
> "
> > @@ -2989,6 +2998,15 @@ i40e_set_rx_function(struct rte_eth_dev *dev)
> >              */
> >             if (rte_cpu_get_flag_enabled(RTE_CPUFLAG_AVX512F))
> >                     dev->rx_pkt_burst = i40e_recv_pkts_vec_avx2;
> > +           /*
> > +            * Give users chance to use the latest vector path
> > +            * that the platform supported.
> > +            */
> > +           if (i40e_parse_latest_vec(dev)) {
> > +                   if (rte_cpu_get_flag_enabled(RTE_CPUFLAG_AVX2))
> > +                           dev->rx_pkt_burst =
> > +                                   i40e_recv_pkts_vec_avx2;
> > +           }
> 
> Same as above
> >  #endif
> >     } else if (ad->rx_bulk_alloc_allowed) {
> >             PMD_INIT_LOG(DEBUG, "Rx Burst Bulk Alloc Preconditions
> are "
> > @@ -3083,6 +3101,15 @@ i40e_set_tx_function(struct rte_eth_dev *dev)
> >                      */
> >                     if
> (rte_cpu_get_flag_enabled(RTE_CPUFLAG_AVX512F))
> >                             dev->tx_pkt_burst =
> i40e_xmit_pkts_vec_avx2;
> > +                   /*
> > +                    * Give users chance to use the latest vector path
> > +                    * that the platform supported.
> > +                    */
> > +                   if (i40e_parse_latest_vec(dev)) {
> > +                           if
> (rte_cpu_get_flag_enabled(RTE_CPUFLAG_AVX2))
> > +                                   dev->tx_pkt_burst =
> > +                                   i40e_xmit_pkts_vec_avx2;
> > +                   }
> 
> Same as above
> 
> >  #endif
> >             } else {
> >                     PMD_INIT_LOG(DEBUG, "Simple tx finally be used.");
> > --
> > 2.17.1
> 
> Regards
> Qi

Reply via email to