Re: ADQ - comparison to aRFS, clarifications on NAPI ID, binding with busy-polling

Samudrala, Sridhar Wed, 24 Jun 2020 13:22:24 -0700



On 6/17/2020 6:15 AM, Maxim Mikityanskiy wrote:

Hi,
I discovered Intel ADQ feature [1] that allows to boost performance bypicking dedicated queues for application traffic. We did some research,and I got some level of understanding how it works, but I have somequestions, and I hope you could answer them.
1. SO_INCOMING_NAPI_ID usage. In my understanding, every connection hasa key (sk_napi_id) that is unique to the NAPI where this connection ishandled, and the application uses that key to choose a handler threadfrom the thread pool. If we have a one-to-one relationship betweenapplication threads and NAPI IDs of connections, each application threadwill handle only traffic from a single NAPI. Is my understanding correct?


Yes. It is correct and recommended with the current implementation.

1.1. I wonder how the application thread gets scheduled on the same corethat NAPI runs at. It currently only works with busy_poll, so when theapplication initiates busy polling (calls epoll), does the Linuxscheduler move the thread to the right CPU? Do we have to have a strictone-to-one relationship between threads and NAPIs, or can one threadhandle multiple NAPIs? When the data arrives, does the scheduler run theapplication thread on the same CPU that NAPI ran on?


The app thread can do busypoll from any core and there is no requirement
that the scheduler needs to move the thread to a specific CPU.

If the NAPI processing happens via interrupts, the scheduler could move
the app thread to the same CPU that NAPI ran on.

1.2. I see that SO_INCOMING_NAPI_ID is tightly coupled with busy_poll.It is enabled only if CONFIG_NET_RX_BUSY_POLL is set. Is there a realreason why it can't be used without busy_poll? In other words, if wemodify the kernel to drop this requirement, will the kernel stillschedule the application thread on the same CPU as NAPI when busy_pollis not used?

It should be OK to remove this restriction, but requires enabling thisin skb_mark_napi_id() and sk_mark_napi_id() too.

2. Can you compare ADQ to aRFS+XPS? aRFS provides a way to steer trafficto the application's CPU in an automatic fashion, and xps_rxqs can beused to transmit from the corresponding queues. This setup doesn't needmanual configuration of TCs and is not limited to 4 applications. Thedifference of ADQ is that (in my understanding) it moves the applicationto the RX CPU, while aRFS steers the traffic to the RX queue handled mythe application's CPU. Is there any advantage of ADQ over aRFS, that Ifailed to find?

aRFS+XPS ties app thread to a cpu, whereas ADQ ties app thread to a napiid which in turn ties to a queue(s)


ADQ also provides 2 levels of filtering compared to aRFS+XPS. The first
level of filtering selects a queue-set associated with the application
and the second level filter or RSS will select a queue within that queue
set associated with an app thread.

The current interface to configure ADQ limits us to support upto 16
application specific queue sets(TC_MAX_QUEUE)

3. At [1], you mention that ADQ can be used to create separate RSS sets. Could you elaborate about the API used? Does the tc mqprioconfiguration also affect RSS? Can it be turned on/off?


Yes. tc mqprio allows to create queue-sets per application and the
driver configures RSS per queue-set.

4. How is tc flower used in context of ADQ? Does the user need toreflect the configuration in both mqprio qdisc (for TX) and tc flower(for RX)? It looks like tc flower maps incoming traffic to TCs, but whatis the mechanism of mapping TCs to RX queues?


tc mqprio is used to map TCs to RX queues

tc flower is used to configure the first level of filter to redirect
packets to a queue set associated with an application.

I really hope you will be able to shed more light on this feature toincrease my awareness on how to use it and to compare it with aRFS.


Hope this helps and we will go over in more detail in our netdev session.

Thanks,
Max
[1]:https://netdevconf.info/0x14/session.html?talk-ADQ-for-system-level-network-io-performance-improvements

Re: ADQ - comparison to aRFS, clarifications on NAPI ID, binding with busy-polling

Reply via email to