On 9/29/2023 12:24 PM, Thomas Monjalon wrote: >29/09/2023 11:46, Ferruh Yigit: >> On 9/29/2023 10:21 AM, Christian Koue Muf wrote: >> > On 9/21/2023 4:05 PM, Ferruh Yigit wrote: >> >> On 9/20/2023 2:17 PM, Thomas Monjalon wrote: >> >>> Hello, >> >>> >> >>> 19/09/2023 11:06, Christian Koue Muf: >> >>>> On 9/18/23 10:34 AM, Ferruh Yigit wrote: >> >>>>> On 9/15/2023 7:37 PM, Morten Brørup wrote: >> >>>>>>> From: Ferruh Yigit [mailto:ferruh.yi...@amd.com] >> >>>>>>> Sent: Friday, 15 September 2023 17.55 >> >>>>>>> >> >>>>>>> On 9/8/2023 5:07 PM, Mykola Kostenok wrote: >> >>>>>>>> From: Christian Koue Muf <c...@napatech.com> >> >>>>>>>> >> >>>>>>>> The NTNIC PMD does not rely on a kernel space Napatech >> >>>>>>>> driver, thus all defines related to the register layout is >> >>>>>>>> part of the PMD code, which will be added in later commits. >> >>>>>>>> >> >>>>>>>> Signed-off-by: Christian Koue Muf <c...@napatech.com> >> >>>>>>>> Reviewed-by: Mykola Kostenok <mko-...@napatech.com> >> >>>>>>>> >> >>>>>>> >> >>>>>>> Hi Mykola, Christiam, >> >>>>>>> >> >>>>>>> This PMD scares me, overall it is a big drop: >> >>>>>>> "249 files changed, 87128 insertions(+)" >> >>>>>>> >> >>>>>>> I think it is not possible to review all in one release cycle, >> >>>>>>> and it is not even possible to say if all code used or not. >> >>>>>>> >> >>>>>>> I can see code is already developed, and it is difficult to >> >>>>>>> restructure developed code, but restructure it into small >> >>>>>>> pieces really helps for reviews. >> >>>>>>> >> >>>>>>> >> >>>>>>> Driver supports good list of features, can it be possible to >> >>>>>>> distribute upstream effort into multiple release. >> >>>>>>> Starting from basic functionality and add features gradually. >> >>>>>>> Target for this release can be providing datapath, and add >> >>>>>>> more if we have time in the release, what do you think? >> >>> >> >>> I was expecting to get only Rx/Tx in this release, not really more. >> >>> >> >>> I agree it may be interesting to discuss some design and check >> >>> whether we need more features in ethdev as part of the driver >> >>> upstreaming process. >> >>> >> >>> >> >>>>>>> Also there are large amount of base code (HAL / FPGA code), >> >>>>>>> instead of adding them as a bulk, relevant ones with a feature >> >>>>>>> can be added with the feature patch, this eliminates dead code >> >>>>>>> in the base code layer, also helps user/review to understand >> >>>>>>> the link between driver code and base code. >> >>> >> >>> Yes it would be interesting to see what is really needed for the >> >>> basic initialization and what is linked to a specific offload or >> >>> configuration feature. >> >>> >> >>> As a maintainer, I have to do some changes across all drivers >> >>> sometimes, and I use git blame a lot to understand why something was >> >>> added. >> >>> >> >>> >> >>>>>> Jumping in here with an opinion about welcoming new NIC vendors to >> >>>>>> the community: >> >>>>>> >> >>>>>> Generally, if a NIC vendor supplies a PMD for their NIC, I expect the >> >>>>>> vendor to take responsibility for the quality of the PMD, including >> >>>>>> providing a maintainer and support backporting of fixes to the PMD in >> >>>>>> LTS releases. This should align with the vendor's business case for >> >>>>>> upstreaming their driver. >> >>>>>> >> >>>>>> If the vendor provides one big patch series, which may be difficult >> >>>>>> to understand/review, the fallout mainly hits the vendor's customers >> >>>>>> (and thus the vendor's support organization), not the community as a >> >>>>>> whole. >> >>>>>> >> >>>>> >> >>>>> Hi Morten, >> >>>>> >> >>>>> I was thinking same before making my above comment, what happens if >> >>>>> vendors submit as one big patch and when a problem occurs we can ask >> >>>>> owner to fix. Probably this makes vendor happy and makes my life (or >> >>>>> any other maintainer's life) easier, it is always easier to say yes. >> >>>>> >> >>>>> >> >>>>> But I come up with two main reasons to ask for a rework: >> >>>>> >> >>>>> 1- Technically any vendor can deliver their software to their >> >>>>> customers via a public git repository, they don't have to >> >>>>> upstream to >> >>>>> https://linkprotect.cudasvc.com/url?a=https%3a%2f%2fdpdk.org&c=E >> >>>>> ,1,N >> >>>>> poJejuuvPdOPfcFJYtsmkQF6PVrDjGsZ8x_gi5xDrTyZokK_nM11u4ZpzHgM10J9 >> >>>>> bOLl nhoR6fFAzWtCzOhRCzVruYj520zZORv6-MjJeSC5TrGnIFL&typo=1, >> >>>>> but upstreaming has many benefits. >> >>>>> >> >>>>> One of those benefits is upstreaming provides a quality assurance for >> >>>>> vendor's customers (that is why customer can be asking for this, as we >> >>>>> are having in many cases), and this quality assurance comes from >> >>>>> additional eyes reviewing the code and guiding vendors for the DPDK >> >>>>> quality standards (some vendors already doing pretty good, but new >> >>>>> ones sometimes requires hand-holding). >> >>>>> >> >>>>> If driver is one big patch series, it is practically not possible to >> >>>>> review it, I can catch a few bits here or there, you may some others, >> >>>>> but practically it will be merged without review, and we will fail on >> >>>>> our quality assurance task. >> >>>>> >> >>>>> 2- Make code more accessible to the rest of the world. >> >>>>> >> >>>>> When it is a big patch, code can be functional but lots of details, >> >>>>> reasoning, relation between components gets lost, which makes it even >> >>>>> harder for an external developer, like me, to understand it (I am a >> >>>>> mere guinea pig here :). >> >>>>> >> >>>>> If a customer would like to add a feature themselves, or fix >> >>>>> something, even after vendor no more working on that product anymore, >> >>>>> customer needs to understand the code or some reasoning in the code. >> >>>>> Or if someone wants to backport the driver to rust, or a DPDK >> >>>>> developer wants to do a rework that requires updating all drivers, or >> >>>>> a tester would like to analyze the code to figure out behavior >> >>>>> difference of the devices. I think I have witness all above cases in >> >>>>> real life. >> >>>>> >> >>>>> If driver is split into more patches, it makes patch easier to >> >>>>> understand which makes code practically more accessible to other >> >>>>> developers that are not expert in driver. >> >>> >> >>> I fully agree about the 2 reasons for upstreaming piece by piece. >> >>> >> >>> >> >>>>> Overall, yes splitting patch takes time and effort, and yes this is an >> >>>>> overhead for a code that is already developed, but I think benefit is >> >>>>> big so it worth doing the task. >> >>> >> >>> In the meantime, if some features are not yet upstreamed in a >> >>> release, a user can apply the missing patches from the mailing list to >> >>> get the features. >> >>> >> >>> >> >>>>>> We, the community, should not make it too difficult for vendors >> >>>>>> trying to upstream their drivers. I certainly consider it >> >>>>>> unreasonable to ask a vendor to postpone the release of some existing >> >>>>>> features by effectively an entire year (considering that only LTS >> >>>>>> releases are relevant for most of us) because we want the vendor to >> >>>>>> refactor the patch series to match our preferences within an >> >>>>>> unrealistic timeframe. >> >>> >> >>> You're right Morten, we try to be as welcoming as possible, but as >> >>> Ferruh said, we want to be able to understand how a driver is >> >>> built, even if not understanding all details. >> >>> >> >>> In Open Source, I think not only the code should be available, we >> >>> must also take care of explanations and documentation. >> >>> >> >>> >> >>>>> Agree to not make upstreaming difficult for new vendors, and indeed we >> >>>>> are encouraging more vendors to be upstream their code, this is in >> >>>>> best interest of both sides. >> >>>>> >> >>>>> Distributing upstreaming effort to a year was just a suggestion, it >> >>>>> can go in earlier as it is becomes ready but I can see it will take >> >>>>> time to split driver into features and upstream them. >> >>> >> >>> Driver features can be added until -rc2 (in one month). >> >>> >> >>> >> >>>>> As I am from a vendor too, I can understand the product/customer >> >>>>> pressure, but I hope this approach can encourage vendors start >> >>>>> upstreaming early or even better upstream as they develop the code. >> >>>> >> >>>> Hi Ferruh, >> >>>> >> >>>> First of all, thank you for starting the work to review our code. >> >>>> >> >>>> As Morten said Napatech plans to take all responsibility for the >> >>>> quality of the PMD source code. We expect to provide all fixes >> >>>> needed in the future. If for some reason Napatech stops >> >>>> maintaining the code, then we have been informed that the DPDK >> >>>> community might delete the PMD from the repository, and we understand >> >>>> that. >> >>>> >> >>>> In regards to splitting the code, I don't see this a good option. >> >>>> While I of cause agree it would be easier to review and >> >>>> understand, the code should also result in a meaningful product. >> >>>> Of the 87k lines of code, 53k lines is needed to start-up the >> >>>> FPGA to a state the it is ready to receive traffic. But at this >> >>>> point all packets would simply be discarded, and to be honest, >> >>>> there are better and cheaper options out there, if nothing more >> >>>> than basic functionality is needed. 34k lines are used to setup >> >>>> filters based on rte_flow. The thing is, that you need to >> >>>> initialize all modules in the FPGA TX- and RX-pipelines with valid >> >>>> data, even if you don't need the features those modules provide. >> >>>> As a result, if you split up the 34k lines, then the product >> >>>> would not be functional. Of cause some of the top level logic >> >>>> could be split out, but at this point we are talking about >> >>>> splitting 87k lines into 80k and 7k, which I don't think is worth it. >> >>> >> >>> Actually I think it is worth. >> >>> There is a benefit in isolating the small basic init part from the >> >>> more complex features. >> >>> >> >>> >> >>>>>>> As far as I understand last patch opens a socket interface and >> >>>>>>> an external application can sent control commands via this interface. >> >>>>>>> I am not sure about this side control channel, what is missing >> >>>>>>> in the DPDK API? Can we try to address them in the DPDK layer >> >>>>>>> instead of a driver specific solution? >> >>>>>> >> >>>>>> That would be great. >> >>>>>> >> >>>>>> AFAIK, other vendors also has a bunch of out-of-band >> >>>>>> communication, e.g. magical EAL parameters to the MLX drivers. >> >>>>>> So let's not be too hard on the newcomers. ;-) >> >>>>>> >> >>>>> >> >>>>> I did some thinking for this one too, >> >>>>> >> >>>>> As we are in userspace, it is easy to have side control channel, and >> >>>>> this can make users life easy, so this is a practical thing to do. >> >>>>> (Indeed there are already some ways to do this, without PMD >> >>>>> exposing a socket interface.) >> >>>>> >> >>>>> But this also reduces effort developers putting on DPDK layer >> >>>>> solution, because it is always easier to add more support to the >> >>>>> driver only. >> >>>>> And overall this reduces portability of the DPDK application, >> >>>>> each application becomes unique to a device (This is a bad >> >>>>> thing, but I also need some feedback how bad it is in real >> >>>>> life.) >> >>>>> >> >>>>> To balance this, we said if a feature is too specific to a device, it >> >>>>> can add device specific API and this is better than device specific >> >>>>> features pollute the common, most used code. And push back to >> >>>>> introduce more new PMD specific APIs unless it is really needed. >> >>>>> >> >>>>> But creating a socket interface directly from the driver is more than >> >>>>> PMD specific API. Technically application control interface can rely >> >>>>> completely to this. Even we assume this is not for control, but just >> >>>>> for debug, I can see it can be useful for debug and again practical >> >>>>> thing to do, I am still not sure how much it hurts if each driver has >> >>>>> a custom socket interface for their debug needs. >> >>>>> >> >>>>> Overall it makes more sense to me to have a unified/common interface >> >>>>> from drivers to DPDK applications, which is through the ethdev layer. >> >>>>> And improve and extend the ethdev layer to satisfy driver needs. >> >>>>> >> >>>>> In this specific example, I am for rejecting the socket interface >> >>>>> patch, but I would like to get more feedback from @techboard. >> >>>>> >> >>>> >> >>>> The reason we have the addition control channel is not provide >> >>>> additional functionality. We have customers with use-cases that >> >>>> require multiple processes. Since Napatech adapters do not >> >>>> support configuration through VFs, then secondary applications >> >>>> must send their rte_flow to a main application, which will then >> >>>> setup the flow through it's PF. This control channel "hides" >> >>>> these details, and make the product easier for users to adapt to their >> >>>> existing solutions. >> >>> >> >>> I think you need to explore VF representors. >> >>> This is what is done with other drivers, and it make them compatible. >> >>> >> >>>> If you stand firm on rejecting the control channel, then we have >> >>>> to go back to the drawing board on this issue. We did look at >> >>>> DPDK's multi-process support, and actually had some support for >> >>>> this, but we determined that for our use-case it was better to >> >>>> have a communication channel, and no shared memory. >> >>> >> >>> I'm not sure your need is about secondary process. >> >>> Let's discuss this need in a meeting if needed. >> >>> Anyway, the message is that we want to be part of such design decision. >> >>> >> >>> >> >>>>> And related to not being too hard on the newcomers, unrelated to being >> >>>>> a newcomer or not, if a process/feature/approach approved once, some >> >>>>> others will point to it and will ask to do the same which is fair in >> >>>>> their perspective. I had multiple instance of this in the past. >> >>>>> >> >>>>> Of course we are being easy to newcomers but not in a way to >> >>>>> allow code that we believe is not good thing to do, but going >> >>>>> easy on process may be :) >> >>>>> >> >>>> >> >>>> We are grateful for any leniency you may show us ;-) >> >>>> >> >>>> Thanks again, >> >>>> Christian >> >>>> >> >>>>> >> >>>>>>> >> >>>>>>> >> >>>>>>> Thanks, >> >>>>>>> ferruh >> >>>>>> >> >>>>>> Thank you, Ferruh, for taking good care of the community by providing >> >>>>>> constructive feedback like this to new NIC vendors! >> >>>>>> >> >>>>>> Please note that my feedback is entirely process related. I didn’t >> >>>>>> review the driver, so I have no technical comments to the patch >> >>>>>> series. >> >>>>>> >> >>>>>> -Morten >> >>> >> >>> >> >>> We are going to discuss the process in the technical board today. >> >>> >> >>> >> >> >> >> Hi Mykola, Christiam, >> >> >> >> As discussed, following are a few good examples from the DPDK history, >> >> there is no "fits all, fixed guidelines", but they can serve as samples: >> >> >> >> Marvell cnxk: >> >> https://linkprotect.cudasvc.com/url?a=https%3a%2f%2fpatchwork.dpdk. >> >> org%2fproject%2fdpdk%2flist%2f%3fseries%3d17449%26state%3d%252A%26a >> >> rchive%3dboth&c=E,1,DmXU0iHwXoSaZ4bKn-yhX9J8XmFBispd2ut7pxLNBkK3Q4L >> >> VpG_zmOf1jnWSS-Y0Fx-TNbPnQDHyBZkDj23Gu7zjPZ5nsA7pid5CsE2vxNk,&typo= >> >> 1 >> >> >> >> >> >> Solarflare sfc (before patchwork series support): >> >> https://linkprotect.cudasvc.com/url?a=https%3a%2f%2fpatchwork.dpdk. >> >> org%2fproject%2fdpdk%2fpatch%2f1480436367-20749-2-git-send-email-ar >> >> ybchenko%40solarflare.com%2f&c=E,1,E9oUT_1WuNC2JA8x7an3rC_Pm5g1L5cx >> >> JKQ6pTwSbCWSJpiLH2GnmgfFkUqViOOwkpS2df8kgBvHjmulKaWhyr4BBizUT-sL5LJ >> >> v21Hx4RtHtK3vjhcKpg,,&typo=1 >> >> to >> >> https://linkprotect.cudasvc.com/url?a=https%3a%2f%2fpatchwork.dpdk. >> >> org%2fproject%2fdpdk%2fpatch%2f1480436367-20749-56-git-send-email-a >> >> rybchenko%40solarflare.com%2f&c=E,1,GByF_TiC_q11iVPpiPgpCMlSge-J0Xf >> >> T0zHkriK0rde1Qt1RG7uf6mETQkTSQ-1V86Z7EtRcxlvSsed1sqn8RWfN8KFSbd7NaA >> >> kfbDiehn_vSRzja45rQgv53Q,,&typo=1 >> >> >> >> >> >> Intel ice: >> >> https://linkprotect.cudasvc.com/url?a=https%3a%2f%2fpatchwork.dpdk. >> >> org%2fproject%2fdpdk%2flist%2f%3fseries%3d2842%26state%3d%252A%26ar >> >> chive%3dboth&c=E,1,zQwvAIR3ToLIhT09bVxm_HEF-dp8eyTqhsKB3eOYgIJdd2WS >> >> _0ZlTbQKfr9KLyTA3A2A2HzBbjIlz21D_hWVgS_INmmC5eew1J0QBH-PoRNd&typo=1 >> >> >> > >> > Thank you for the links, they have been very helpful. >> > >> > After a lot of internal discussion, Napatech has decided to implement some >> > architectural changes to our PMD that will allow us to easier split up the >> > code into smaller features. The work will require some time, which means >> > that we will not be ready for the 23.11 release. The current goal is to >> > attempt to upstream a quite basic PMD in time for 24.7, and a fully >> > featured PMD for 24.11. >> > >> > >> >> Hi Christiam, >> >> Good to see there is a solid plan for upstreaming but also not that >> good that it is postponed, >> >> I am aware it is all tied to your internal planning/resourcing etc, >> but since the effort already started, can it be possible to squeeze >> very basic driver in this release, which just does link up and most basic >> Rx/Tx? >> It gives opportunity to experiment on device to users. >> >> We can accept it up to -rc3, which is end of October, so there is >> still some time? >> >> This is just a suggestion though, no pressure intended. > > I agree with Ferruh, better to start early and small. > It shouldn't be too hard to introduce the skeleton of the driver.
Hi Ferruh and Thomas, My apologies for the late response, I have been sick the last week. We can try to create a small PMD in time. The reason I'm cautious is because Napatech plan to make quite large changes to the PMD, to achieve a more stable and modular code-base. This means that future updates will have quite large diffs, until these changes are in place.