> -----Original Message-----
> From: Ilya Maximets [mailto:i.maxim...@samsung.com]
> Sent: Wednesday, September 12, 2018 4:05 PM
> To: Zhang, Qi Z <qi.z.zh...@intel.com>; dev@dpdk.org
> Cc: Lu, Wenzhuo <wenzhuo...@intel.com>; Ananyev, Konstantin
> <konstantin.anan...@intel.com>; Laurent Hardy
> <laurent.ha...@6wind.com>; Dai, Wei <wei....@intel.com>;
> sta...@dpdk.org
> Subject: Re: [dpdk-dev] [PATCH] net/ixgbe: fix busy polling while fiber link
> update
> 
> On 12.09.2018 09:49, Zhang, Qi Z wrote:
> >
> >
> >> -----Original Message-----
> >> From: Ilya Maximets [mailto:i.maxim...@samsung.com]
> >> Sent: Monday, September 10, 2018 11:09 PM
> >> To: Zhang, Qi Z <qi.z.zh...@intel.com>; dev@dpdk.org
> >> Cc: Lu, Wenzhuo <wenzhuo...@intel.com>; Ananyev, Konstantin
> >> <konstantin.anan...@intel.com>; Laurent Hardy
> >> <laurent.ha...@6wind.com>; Dai, Wei <wei....@intel.com>;
> >> sta...@dpdk.org
> >> Subject: Re: [dpdk-dev] [PATCH] net/ixgbe: fix busy polling while
> >> fiber link update
> >>
> >> On 04.09.2018 09:08, Zhang, Qi Z wrote:
> >>> Hi Ilya:
> >>>
> >>>> -----Original Message-----
> >>>> From: dev [mailto:dev-boun...@dpdk.org] On Behalf Of Ilya Maximets
> >>>> Sent: Friday, August 31, 2018 8:40 PM
> >>>> To: dev@dpdk.org
> >>>> Cc: Lu, Wenzhuo <wenzhuo...@intel.com>; Ananyev, Konstantin
> >>>> <konstantin.anan...@intel.com>; Laurent Hardy
> >>>> <laurent.ha...@6wind.com>; Dai, Wei <wei....@intel.com>; Ilya
> >>>> Maximets <i.maxim...@samsung.com>; sta...@dpdk.org
> >>>> Subject: [dpdk-dev] [PATCH] net/ixgbe: fix busy polling while fiber
> >>>> link update
> >>>>
> >>>> If the multispeed fiber link is in DOWN state, ixgbe_setup_link
> >>>> could take around a second of busy polling. This is highly
> >>>> inconvenient for the case where single thread periodically checks the 
> >>>> link
> statuses.
> >>>> For example, OVS main thread periodically updates the link statuses
> >>>> and hangs for a really long time busy waiting on ixgbe_setup_link()
> >>>> for a DOWN fiber ports. For case with 3 down ports it hangs for a 3
> >>>> seconds and unable to do anything including packet processing.
> >>>> Fix that by shifting that workaround to a separate thread by alarm
> >>>> handler that will try to set up link if it is DOWN.
> >>>
> >>> Does that mean we will block the interrupt thread for 3 seconds?
> >>
> >> Three times for one second. Other work could be scheduled between.
> >> IMHO, it's much better than blocking usual caller for 3 seconds.
> >>
> >>> Also, can we guarantee there will not be any race condition if we
> >>> call
> >> ixgbe_setup_link at another thread, the base code API is not assumed
> >> to be thread-safe as I know.
> >>
> >> The only user of 'ixgbe_setup_link' is 'ixgbe_dev_start', but it
> >> could be called only if device stopped. 'ixgbe_dev_stop' cancels the alarm.
> >> Race with 'link_update' avoided by 'IXGBE_FLAG_NEED_LINK_CONFIG' flag.
> >
> > I guess, it' not only about when ixgb_setup_link race with itself, but also
> when it race with other APIs.
> > Also the concern is, even in current version, we can prove there is no 
> > issue,
> how can we guarantee we are safe for future base code update? It's not
> designed as thread-safe.
> > For my option, the change is risky.
> 
> In current implementation interrupt handler already calls the
> 'ixgbe_dev_link_update' which subsequently calls 'ixgbe_setup_link'
> in our case if LSC interrupts enabled. So, my change makes the driver even
> safer by moving 'ixgbe_setup_link' to the same interrupt thread.
> Otherwise two threads (interrupts handler and the link status checking thread)
> could call 'ixgbe_setup_link' simultaneously.

Ok, you are right, seems the concern I have is already exist , your patch does 
not introduce new issue.
So I have no objection if this will fix some issue.
But let's check if any ixgbe experts will comment.

Regards
Qi

> 
> >
> > Btw, since ixgbe support LSC, it is not necessary for "single thread
> periodically checks the link statuses", right?
> 
> In current implementation it will take at least 5 seconds (4 + 1) for the 
> interrupt
> handler to detect DOWN link state for ixgbe multispeed fiber. This is too much
> for many real world cases.
> 
> >
> >>
> >>>
> >>> Regards
> >>> Qi
> >>>
> >>>>
> >>>> Fixes: c12d22f65b13 ("net/ixgbe: ensure link status is updated")
> >>>> CC: sta...@dpdk.org
> >>>>
> >>>> Signed-off-by: Ilya Maximets <i.maxim...@samsung.com>
> >>>> ---
> >>>>  drivers/net/ixgbe/ixgbe_ethdev.c | 43
> >>>> ++++++++++++++++++++++++--------
> >>>>  1 file changed, 32 insertions(+), 11 deletions(-)
> >>>>
> >>>> diff --git a/drivers/net/ixgbe/ixgbe_ethdev.c
> >>>> b/drivers/net/ixgbe/ixgbe_ethdev.c
> >>>> index 26b192737..a33b9a6e8 100644
> >>>> --- a/drivers/net/ixgbe/ixgbe_ethdev.c
> >>>> +++ b/drivers/net/ixgbe/ixgbe_ethdev.c
> >>>> @@ -221,6 +221,8 @@ static int ixgbe_dev_interrupt_action(struct
> >>>> rte_eth_dev *dev,
> >>>>                                        struct rte_intr_handle *handle);  
> >>>> static void
> >>>> ixgbe_dev_interrupt_handler(void *param);  static void
> >>>> ixgbe_dev_interrupt_delayed_handler(void *param);
> >>>> +static void ixgbe_dev_setup_link_alarm_handler(void *param);
> >>>> +
> >>>>  static int ixgbe_add_rar(struct rte_eth_dev *dev, struct
> >>>> ether_addr *mac_addr,
> >>>>                           uint32_t index, uint32_t pool);  static void
> >>>> ixgbe_remove_rar(struct rte_eth_dev *dev, uint32_t index); @@
> >>>> -2791,6 +2793,8 @@ ixgbe_dev_stop(struct rte_eth_dev *dev)
> >>>>
> >>>>          PMD_INIT_FUNC_TRACE();
> >>>>
> >>>> +        rte_eal_alarm_cancel(ixgbe_dev_setup_link_alarm_handler, dev);
> >>>> +
> >>>>          /* disable interrupts */
> >>>>          ixgbe_disable_intr(hw);
> >>>>
> >>>> @@ -3969,6 +3973,25 @@ ixgbevf_check_link(struct ixgbe_hw *hw,
> >>>> ixgbe_link_speed *speed,
> >>>>          return ret_val;
> >>>>  }
> >>>>
> >>>> +static void
> >>>> +ixgbe_dev_setup_link_alarm_handler(void *param) {
> >>>> +        struct rte_eth_dev *dev = (struct rte_eth_dev *)param;
> >>>> +        struct ixgbe_hw *hw =
> >>>> IXGBE_DEV_PRIVATE_TO_HW(dev->data->dev_private);
> >>>> +        struct ixgbe_interrupt *intr =
> >>>> +                IXGBE_DEV_PRIVATE_TO_INTR(dev->data->dev_private);
> >>>> +        u32 speed;
> >>>> +        bool autoneg = false;
> >>>> +
> >>>> +        speed = hw->phy.autoneg_advertised;
> >>>> +        if (!speed)
> >>>> +                ixgbe_get_link_capabilities(hw, &speed, &autoneg);
> >>>> +
> >>>> +        ixgbe_setup_link(hw, speed, true);
> >>>> +
> >>>> +        intr->flags &= ~IXGBE_FLAG_NEED_LINK_CONFIG; }
> >>>> +
> >>>>  /* return 0 means link status changed, -1 means not changed */
> >>>> int ixgbe_dev_link_update_share(struct rte_eth_dev *dev, @@ -3981,9
> >>>> +4004,7 @@ ixgbe_dev_link_update_share(struct rte_eth_dev *dev,
> >>>>                  IXGBE_DEV_PRIVATE_TO_INTR(dev->data->dev_private);
> >>>>          int link_up;
> >>>>          int diag;
> >>>> -        u32 speed = 0;
> >>>>          int wait = 1;
> >>>> -        bool autoneg = false;
> >>>>
> >>>>          memset(&link, 0, sizeof(link));
> >>>>          link.link_status = ETH_LINK_DOWN; @@ -3993,13 +4014,8 @@
> >>>> ixgbe_dev_link_update_share(struct
> >> rte_eth_dev
> >>>> *dev,
> >>>>
> >>>>          hw->mac.get_link_status = true;
> >>>>
> >>>> -        if ((intr->flags & IXGBE_FLAG_NEED_LINK_CONFIG) &&
> >>>> -                ixgbe_get_media_type(hw) == ixgbe_media_type_fiber) {
> >>>> -                speed = hw->phy.autoneg_advertised;
> >>>> -                if (!speed)
> >>>> -                        ixgbe_get_link_capabilities(hw, &speed, 
> >>>> &autoneg);
> >>>> -                ixgbe_setup_link(hw, speed, true);
> >>>> -        }
> >>>> +        if (intr->flags & IXGBE_FLAG_NEED_LINK_CONFIG)
> >>>> +                return rte_eth_linkstatus_set(dev, &link);
> >>>>
> >>>>          /* check if it needs to wait to complete, if lsc interrupt is 
> >>>> enabled */
> >>>>          if (wait_to_complete == 0 || dev->data->dev_conf.intr_conf.lsc 
> >>>> !=
> >>>> 0) @@
> >>>> -4017,11 +4033,14 @@ ixgbe_dev_link_update_share(struct rte_eth_dev
> >> *dev,
> >>>>          }
> >>>>
> >>>>          if (link_up == 0) {
> >>>> -                intr->flags |= IXGBE_FLAG_NEED_LINK_CONFIG;
> >>>> +                if (ixgbe_get_media_type(hw) == ixgbe_media_type_fiber) 
> >>>> {
> >>>> +                        intr->flags |= IXGBE_FLAG_NEED_LINK_CONFIG;
> >>>> +                        rte_eal_alarm_set(10,
> >>>> +                                ixgbe_dev_setup_link_alarm_handler, 
> >>>> dev);
> >>>> +                }
> >>>>                  return rte_eth_linkstatus_set(dev, &link);
> >>>>          }
> >>>>
> >>>> -        intr->flags &= ~IXGBE_FLAG_NEED_LINK_CONFIG;
> >>>>          link.link_status = ETH_LINK_UP;
> >>>>          link.link_duplex = ETH_LINK_FULL_DUPLEX;
> >>>>
> >>>> @@ -5128,6 +5147,8 @@ ixgbevf_dev_stop(struct rte_eth_dev *dev)
> >>>>
> >>>>          PMD_INIT_FUNC_TRACE();
> >>>>
> >>>> +        rte_eal_alarm_cancel(ixgbe_dev_setup_link_alarm_handler, dev);
> >>>> +
> >>>>          ixgbevf_intr_disable(dev);
> >>>>
> >>>>          hw->adapter_stopped = 1;
> >>>> --
> >>>> 2.17.1
> >>>

Reply via email to