Yes atomic load is perfect.
rte_rmb is worst :)

19/11/2024 09:52, 王颢:
> Dear Thomas,
> 
> How about the following modification? Or could I just change rte_smp_rmb to 
> rte_rmb instead?
> 
> struct rtl_tx_desc {
>       RTE_ATOMIC(u32) opts1;
>       u32 opts2;
>       u64 addr;
>       u32 reserved0;
>       u32 reserved1;
>       u32 reserved2;
>       u32 reserved3;
> };
> 
> static u32
> rtl_get_opts1(struct rtl_tx_desc *txd)
> {
>     u32 opts1 = rte_atomic_load_explicit (&txd->opts1, 
> rte_memory_order_acquire);
> 
>       return rte_le_to_cpu_32(opts1);
> }
> 
> 
> Best Wishes,
> Howard Wang
> 
> -----邮件原件-----
> 发件人: Thomas Monjalon <tho...@monjalon.net> 
> 发送时间: 2024年11月19日 15:16
> 收件人: pro_nic_d...@realtek.com; 王颢 <howard_w...@realsil.com.cn>
> 抄送: dev@dpdk.org; Ferruh Yigit <ferruh.yi...@amd.com>
> 主题: Re: 答复: [PATCH v8 12/17] net/r8169: implement Tx path
> 
> 
> External mail.
> 
> 
> 
> The change below is replacing 1 rte_smp_rmb with 2 calls, so no it is not 
> what I am asking for.
> Please could you check how to not calling this function at all?
> 
> This series is already merged in the main branch, so any new change should be 
> submitted as a new patch.
> 
> Thank you
> 
> 
> 19/11/2024 07:29, 王颢:
> > Dear Thomas,
> >
> > After our discussion, we concluded that we can make the following changes. 
> > What do you think? Additionally, I may not have been very clear in my last 
> > email. Should I submit the entire series of changes as PATCH v9, or should 
> > I respond with [PATCH v9 12/17] net/r8169: implement Tx path?
> >
> > static u32
> > rtl_get_opts1(struct rtl_tx_desc *txd) {
> > -     rte_smp_rmb();
> >
> >       return rte_le_to_cpu_32(txd->opts1); }
> >
> > static void
> > rtl_tx_clean(struct rtl_hw *hw, struct rtl_tx_queue *txq) {
> >               ...
> >               tx_left = (rte_atomic_load_explicit(&txq->tx_tail,
> >                                                   rte_memory_order_relaxed) 
> > % nb_tx_desc) - head;
> >       }
> >
> > +     rte_smp_rmb();
> >
> >       while (tx_left > 0) {
> >               txd = &txq->hw_ring[head];
> >
> >               if (!enable_tx_no_close && (rtl_get_opts1(txd) & DescOwn))
> >                       break;
> >               ...
> > }
> >
> >
> > int
> > rtl_tx_done_cleanup(void *tx_queue, uint32_t free_cnt) {
> >               ...
> >               tx_left = (rte_atomic_load_explicit(&txq->tx_tail,
> >                                                   rte_memory_order_relaxed) 
> > % nb_tx_desc) - head;
> >       }
> >
> > +     rte_smp_rmb();
> >
> >       while (tx_left > 0) {
> >               txd = &txq->hw_ring[head];
> >
> >               status = rtl_get_opts1(txd);
> >
> >               if (!enable_tx_no_close && (status & DescOwn))
> >                       break;
> >
> >               ...
> > }
> >
> > Best Regards,
> > Howard Wang
> >
> > -----邮件原件-----
> > 发件人: Thomas Monjalon <tho...@monjalon.net>
> > 发送时间: 2024年11月18日 22:59
> > 收件人: pro_nic_d...@realtek.com; 王颢 <howard_w...@realsil.com.cn>
> > 抄送: dev@dpdk.org; Ferruh Yigit <ferruh.yi...@amd.com>
> > 主题: Re: [PATCH v8 12/17] net/r8169: implement Tx path
> >
> >
> > External mail.
> >
> >
> >
> > Hello,
> >
> > 13/11/2024 10:28, Howard Wang:
> > > +static u32
> > > +rtl_get_opts1(struct rtl_tx_desc *txd) {
> > > +       rte_smp_rmb();
> > > +
> > > +       return rte_le_to_cpu_32(txd->opts1); }
> >
> > We should avoid using such heavy memory barrier.
> > Is it possible to use a lighter fence or atomic API?
> >
> >
> >
> 
> 
> 
> 
> 
> 





Reply via email to