29/09/2023 11:46, Ferruh Yigit:
> On 9/29/2023 10:21 AM, Christian Koue Muf wrote:
> > On 9/21/2023 4:05 PM, Ferruh Yigit wrote:
> >> On 9/20/2023 2:17 PM, Thomas Monjalon wrote:
> >>> Hello,
> >>>
> >>> 19/09/2023 11:06, Christian Koue Muf:
> >>>> On 9/18/23 10:34 AM, Ferruh Yigit wrote:
> >>>>> On 9/15/2023 7:37 PM, Morten Brørup wrote:
> >>>>>>> From: Ferruh Yigit [mailto:ferruh.yi...@amd.com]
> >>>>>>> Sent: Friday, 15 September 2023 17.55
> >>>>>>>
> >>>>>>> On 9/8/2023 5:07 PM, Mykola Kostenok wrote:
> >>>>>>>> From: Christian Koue Muf <c...@napatech.com>
> >>>>>>>>
> >>>>>>>> The NTNIC PMD does not rely on a kernel space Napatech driver,
> >>>>>>>> thus all defines related to the register layout is part of the
> >>>>>>>> PMD code, which will be added in later commits.
> >>>>>>>>
> >>>>>>>> Signed-off-by: Christian Koue Muf <c...@napatech.com>
> >>>>>>>> Reviewed-by: Mykola Kostenok <mko-...@napatech.com>
> >>>>>>>>
> >>>>>>>
> >>>>>>> Hi Mykola, Christiam,
> >>>>>>>
> >>>>>>> This PMD scares me, overall it is a big drop:
> >>>>>>> "249 files changed, 87128 insertions(+)"
> >>>>>>>
> >>>>>>> I think it is not possible to review all in one release cycle, and
> >>>>>>> it is not even possible to say if all code used or not.
> >>>>>>>
> >>>>>>> I can see code is already developed, and it is difficult to
> >>>>>>> restructure developed code, but restructure it into small pieces
> >>>>>>> really helps for reviews.
> >>>>>>>
> >>>>>>>
> >>>>>>> Driver supports good list of features, can it be possible to
> >>>>>>> distribute upstream effort into multiple release.
> >>>>>>> Starting from basic functionality and add features gradually.
> >>>>>>> Target for this release can be providing datapath, and add more if
> >>>>>>> we have time in the release, what do you think?
> >>>
> >>> I was expecting to get only Rx/Tx in this release, not really more.
> >>>
> >>> I agree it may be interesting to discuss some design and check whether
> >>> we need more features in ethdev as part of the driver upstreaming
> >>> process.
> >>>
> >>>
> >>>>>>> Also there are large amount of base code (HAL / FPGA code),
> >>>>>>> instead of adding them as a bulk, relevant ones with a feature can
> >>>>>>> be added with the feature patch, this eliminates dead code in the
> >>>>>>> base code layer, also helps user/review to understand the link
> >>>>>>> between driver code and base code.
> >>>
> >>> Yes it would be interesting to see what is really needed for the basic
> >>> initialization and what is linked to a specific offload or configuration 
> >>> feature.
> >>>
> >>> As a maintainer, I have to do some changes across all drivers
> >>> sometimes, and I use git blame a lot to understand why something was 
> >>> added.
> >>>
> >>>
> >>>>>> Jumping in here with an opinion about welcoming new NIC vendors to the 
> >>>>>> community:
> >>>>>>
> >>>>>> Generally, if a NIC vendor supplies a PMD for their NIC, I expect the 
> >>>>>> vendor to take responsibility for the quality of the PMD, including 
> >>>>>> providing a maintainer and support backporting of fixes to the PMD in 
> >>>>>> LTS releases. This should align with the vendor's business case for 
> >>>>>> upstreaming their driver.
> >>>>>>
> >>>>>> If the vendor provides one big patch series, which may be difficult to 
> >>>>>> understand/review, the fallout mainly hits the vendor's customers (and 
> >>>>>> thus the vendor's support organization), not the community as a whole.
> >>>>>>
> >>>>>
> >>>>> Hi Morten,
> >>>>>
> >>>>> I was thinking same before making my above comment, what happens if 
> >>>>> vendors submit as one big patch and when a problem occurs we can ask 
> >>>>> owner to fix. Probably this makes vendor happy and makes my life (or 
> >>>>> any other maintainer's life) easier, it is always easier to say yes.
> >>>>>
> >>>>>
> >>>>> But I come up with two main reasons to ask for a rework:
> >>>>>
> >>>>> 1- Technically any vendor can deliver their software to their
> >>>>> customers via a public git repository, they don't have to upstream
> >>>>> to
> >>>>> https://linkprotect.cudasvc.com/url?a=https%3a%2f%2fdpdk.org&c=E,1,N
> >>>>> poJejuuvPdOPfcFJYtsmkQF6PVrDjGsZ8x_gi5xDrTyZokK_nM11u4ZpzHgM10J9bOLl
> >>>>> nhoR6fFAzWtCzOhRCzVruYj520zZORv6-MjJeSC5TrGnIFL&typo=1,
> >>>>> but upstreaming has many benefits.
> >>>>>
> >>>>> One of those benefits is upstreaming provides a quality assurance for 
> >>>>> vendor's customers (that is why customer can be asking for this, as we 
> >>>>> are having in many cases), and this quality assurance comes from 
> >>>>> additional eyes reviewing the code and guiding vendors for the DPDK 
> >>>>> quality standards (some vendors already doing pretty good, but new ones 
> >>>>> sometimes requires hand-holding).
> >>>>>
> >>>>> If driver is one big patch series, it is practically not possible to 
> >>>>> review it, I can catch a few bits here or there, you may some others, 
> >>>>> but practically it will be merged without review, and we will fail on 
> >>>>> our quality assurance task.
> >>>>>
> >>>>> 2- Make code more accessible to the rest of the world.
> >>>>>
> >>>>> When it is a big patch, code can be functional but lots of details, 
> >>>>> reasoning, relation between components gets lost, which makes it even 
> >>>>> harder for an external developer, like me, to understand it (I am a 
> >>>>> mere guinea pig here :).
> >>>>>
> >>>>> If a customer would like to add a feature themselves, or fix something, 
> >>>>> even after vendor no more working on that product anymore, customer 
> >>>>> needs to understand the code or some reasoning in the code.
> >>>>> Or if someone wants to backport the driver to rust, or a DPDK developer 
> >>>>> wants to do a rework that requires updating all drivers, or a tester 
> >>>>> would like to analyze the code to figure out behavior difference of the 
> >>>>> devices. I think I have witness all above cases in real life.
> >>>>>
> >>>>> If driver is split into more patches, it makes patch easier to 
> >>>>> understand which makes code practically more accessible to other 
> >>>>> developers that are not expert in driver.
> >>>
> >>> I fully agree about the 2 reasons for upstreaming piece by piece.
> >>>
> >>>
> >>>>> Overall, yes splitting patch takes time and effort, and yes this is an 
> >>>>> overhead for a code that is already developed, but I think benefit is 
> >>>>> big so it worth doing the task.
> >>>
> >>> In the meantime, if some features are not yet upstreamed in a release,
> >>> a user can apply the missing patches from the mailing list to get the 
> >>> features.
> >>>
> >>>
> >>>>>> We, the community, should not make it too difficult for vendors trying 
> >>>>>> to upstream their drivers. I certainly consider it unreasonable to ask 
> >>>>>> a vendor to postpone the release of some existing features by 
> >>>>>> effectively an entire year (considering that only LTS releases are 
> >>>>>> relevant for most of us) because we want the vendor to refactor the 
> >>>>>> patch series to match our preferences within an unrealistic timeframe.
> >>>
> >>> You're right Morten, we try to be as welcoming as possible, but as
> >>> Ferruh said, we want to be able to understand how a driver is built,
> >>> even if not understanding all details.
> >>>
> >>> In Open Source, I think not only the code should be available, we must
> >>> also take care of explanations and documentation.
> >>>
> >>>
> >>>>> Agree to not make upstreaming difficult for new vendors, and indeed we 
> >>>>> are encouraging more vendors to be upstream their code, this is in best 
> >>>>> interest of both sides.
> >>>>>
> >>>>> Distributing upstreaming effort to a year was just a suggestion, it can 
> >>>>> go in earlier as it is becomes ready but I can see it will take time to 
> >>>>> split driver into features and upstream them.
> >>>
> >>> Driver features can be added until -rc2 (in one month).
> >>>
> >>>
> >>>>> As I am from a vendor too, I can understand the product/customer 
> >>>>> pressure, but I hope this approach can encourage vendors start 
> >>>>> upstreaming early or even better upstream as they develop the code.
> >>>>
> >>>> Hi Ferruh,
> >>>>
> >>>> First of all, thank you for starting the work to review our code.
> >>>>
> >>>> As Morten said Napatech plans to take all responsibility for the
> >>>> quality of the PMD source code. We expect to provide all fixes needed
> >>>> in the future. If for some reason Napatech stops maintaining the
> >>>> code, then we have been informed that the DPDK community might delete
> >>>> the PMD from the repository, and we understand that.
> >>>>
> >>>> In regards to splitting the code, I don't see this a good option.
> >>>> While I of cause agree it would be easier to review and understand,
> >>>> the code should also result in a meaningful product. Of the 87k lines
> >>>> of code, 53k lines is needed to start-up the FPGA to a state the it
> >>>> is ready to receive traffic. But at this point all packets would
> >>>> simply be discarded, and to be honest, there are better and cheaper
> >>>> options out there, if nothing more than basic functionality is
> >>>> needed. 34k lines are used to setup filters based on rte_flow. The
> >>>> thing is, that you need to initialize all modules in the FPGA TX- and
> >>>> RX-pipelines with valid data, even if you don't need the features those 
> >>>> modules provide.
> >>>> As a result, if you split up the 34k lines, then the product would
> >>>> not be functional. Of cause some of the top level logic could be
> >>>> split out, but at this point we are talking about splitting 87k lines
> >>>> into 80k and 7k, which I don't think is worth it.
> >>>
> >>> Actually I think it is worth.
> >>> There is a benefit in isolating the small basic init part from the
> >>> more complex features.
> >>>
> >>>
> >>>>>>> As far as I understand last patch opens a socket interface and an
> >>>>>>> external application can sent control commands via this interface.
> >>>>>>> I am not sure about this side control channel, what is missing in
> >>>>>>> the DPDK API? Can we try to address them in the DPDK layer instead
> >>>>>>> of a driver specific solution?
> >>>>>>
> >>>>>> That would be great.
> >>>>>>
> >>>>>> AFAIK, other vendors also has a bunch of out-of-band communication,
> >>>>>> e.g. magical EAL parameters to the MLX drivers. So let's not be too
> >>>>>> hard on the newcomers. ;-)
> >>>>>>
> >>>>>
> >>>>> I did some thinking for this one too,
> >>>>>
> >>>>> As we are in userspace, it is easy to have side control channel, and 
> >>>>> this can make users life easy, so this is a practical thing to do.
> >>>>> (Indeed there are already some ways to do this, without PMD exposing
> >>>>> a socket interface.)
> >>>>>
> >>>>> But this also reduces effort developers putting on DPDK layer solution, 
> >>>>> because it is always easier to add more support to the driver only.
> >>>>> And overall this reduces portability of the DPDK application, each
> >>>>> application becomes unique to a device (This is a bad thing, but I
> >>>>> also need some feedback how bad it is in real life.)
> >>>>>
> >>>>> To balance this, we said if a feature is too specific to a device, it 
> >>>>> can add device specific API and this is better than device specific 
> >>>>> features pollute the common, most used code. And push back to introduce 
> >>>>> more new PMD specific APIs unless it is really needed.
> >>>>>
> >>>>> But creating a socket interface directly from the driver is more than 
> >>>>> PMD specific API. Technically application control interface can rely 
> >>>>> completely to this. Even we assume this is not for control, but just 
> >>>>> for debug, I can see it can be useful for debug and again practical 
> >>>>> thing to do, I am still not sure how much it hurts if each driver has a 
> >>>>> custom socket interface for their debug needs.
> >>>>>
> >>>>> Overall it makes more sense to me to have a unified/common interface 
> >>>>> from drivers to DPDK applications, which is through the ethdev layer.
> >>>>> And improve and extend the ethdev layer to satisfy driver needs.
> >>>>>
> >>>>> In this specific example, I am for rejecting the socket interface 
> >>>>> patch, but I would like to get more feedback from @techboard.
> >>>>>
> >>>>
> >>>> The reason we have the addition control channel is not provide
> >>>> additional functionality. We have customers with use-cases that
> >>>> require multiple processes. Since Napatech adapters do not support
> >>>> configuration through VFs, then secondary applications must send
> >>>> their rte_flow to a main application, which will then setup the flow
> >>>> through it's PF. This control channel "hides" these details, and make
> >>>> the product easier for users to adapt to their existing solutions.
> >>>
> >>> I think you need to explore VF representors.
> >>> This is what is done with other drivers, and it make them compatible.
> >>>
> >>>> If you stand firm on rejecting the control channel, then we have to
> >>>> go back to the drawing board on this issue. We did look at DPDK's
> >>>> multi-process support, and actually had some support for this, but we
> >>>> determined that for our use-case it was better to have a
> >>>> communication channel, and no shared memory.
> >>>
> >>> I'm not sure your need is about secondary process.
> >>> Let's discuss this need in a meeting if needed.
> >>> Anyway, the message is that we want to be part of such design decision.
> >>>
> >>>
> >>>>> And related to not being too hard on the newcomers, unrelated to being 
> >>>>> a newcomer or not, if a process/feature/approach approved once, some 
> >>>>> others will point to it and will ask to do the same which is fair in 
> >>>>> their perspective. I had multiple instance of this in the past.
> >>>>>
> >>>>> Of course we are being easy to newcomers but not in a way to allow
> >>>>> code that we believe is not good thing to do, but going easy on
> >>>>> process may be :)
> >>>>>
> >>>>
> >>>> We are grateful for any leniency you may show us ;-)
> >>>>
> >>>> Thanks again,
> >>>> Christian
> >>>>
> >>>>>
> >>>>>>>
> >>>>>>>
> >>>>>>> Thanks,
> >>>>>>> ferruh
> >>>>>>
> >>>>>> Thank you, Ferruh, for taking good care of the community by providing 
> >>>>>> constructive feedback like this to new NIC vendors!
> >>>>>>
> >>>>>> Please note that my feedback is entirely process related. I didn’t 
> >>>>>> review the driver, so I have no technical comments to the patch series.
> >>>>>>
> >>>>>> -Morten
> >>>
> >>>
> >>> We are going to discuss the process in the technical board today.
> >>>
> >>>
> >>
> >> Hi Mykola, Christiam,
> >>
> >> As discussed, following are a few good examples from the DPDK history, 
> >> there is no "fits all, fixed guidelines", but they can serve as samples:
> >>
> >> Marvell cnxk:
> >> https://linkprotect.cudasvc.com/url?a=https%3a%2f%2fpatchwork.dpdk.org%2fproject%2fdpdk%2flist%2f%3fseries%3d17449%26state%3d%252A%26archive%3dboth&c=E,1,DmXU0iHwXoSaZ4bKn-yhX9J8XmFBispd2ut7pxLNBkK3Q4LVpG_zmOf1jnWSS-Y0Fx-TNbPnQDHyBZkDj23Gu7zjPZ5nsA7pid5CsE2vxNk,&typo=1
> >>
> >>
> >> Solarflare sfc (before patchwork series support):
> >> https://linkprotect.cudasvc.com/url?a=https%3a%2f%2fpatchwork.dpdk.org%2fproject%2fdpdk%2fpatch%2f1480436367-20749-2-git-send-email-arybchenko%40solarflare.com%2f&c=E,1,E9oUT_1WuNC2JA8x7an3rC_Pm5g1L5cxJKQ6pTwSbCWSJpiLH2GnmgfFkUqViOOwkpS2df8kgBvHjmulKaWhyr4BBizUT-sL5LJv21Hx4RtHtK3vjhcKpg,,&typo=1
> >> to
> >> https://linkprotect.cudasvc.com/url?a=https%3a%2f%2fpatchwork.dpdk.org%2fproject%2fdpdk%2fpatch%2f1480436367-20749-56-git-send-email-arybchenko%40solarflare.com%2f&c=E,1,GByF_TiC_q11iVPpiPgpCMlSge-J0XfT0zHkriK0rde1Qt1RG7uf6mETQkTSQ-1V86Z7EtRcxlvSsed1sqn8RWfN8KFSbd7NaAkfbDiehn_vSRzja45rQgv53Q,,&typo=1
> >>
> >>
> >> Intel ice:
> >> https://linkprotect.cudasvc.com/url?a=https%3a%2f%2fpatchwork.dpdk.org%2fproject%2fdpdk%2flist%2f%3fseries%3d2842%26state%3d%252A%26archive%3dboth&c=E,1,zQwvAIR3ToLIhT09bVxm_HEF-dp8eyTqhsKB3eOYgIJdd2WS_0ZlTbQKfr9KLyTA3A2A2HzBbjIlz21D_hWVgS_INmmC5eew1J0QBH-PoRNd&typo=1
> >>
> > 
> > Thank you for the links, they have been very helpful.
> > 
> > After a lot of internal discussion, Napatech has decided to implement some 
> > architectural changes to our PMD that will allow us to easier split up the 
> > code into smaller features. The work will require some time, which means 
> > that we will not be ready for the 23.11 release. The current goal is to 
> > attempt to upstream a quite basic PMD in time for 24.7, and a fully 
> > featured PMD for 24.11.
> > 
> > 
> 
> Hi Christiam,
> 
> Good to see there is a solid plan for upstreaming but also not that good
> that it is postponed,
> 
> I am aware it is all tied to your internal planning/resourcing etc, but
> since the effort already started, can it be possible to squeeze very
> basic driver in this release, which just does link up and most basic Rx/Tx?
> It gives opportunity to experiment on device to users.
> 
> We can accept it up to -rc3, which is end of October, so there is still
> some time?
> 
> This is just a suggestion though, no pressure intended.

I agree with Ferruh, better to start early and small.
It shouldn't be too hard to introduce the skeleton of the driver.




Reply via email to