Re: [RFC v3] net/af_packet: make stats reset reliable

Mattias Rönnblom Wed, 08 May 2024 00:19:21 -0700

On 2024-05-04 00:00, Stephen Hemminger wrote:

On Fri, 3 May 2024 16:45:47 +0100
Ferruh Yigit <ferruh.yi...@amd.com> wrote:

For stats reset, use an offset instead of zeroing out actual stats values,
get_stats() displays diff between stats and offset.
This way stats only updated in datapath and offset only updated in stats
reset function. This makes stats reset function more reliable.

As stats only written by single thread, we can remove 'volatile' qualifier
which should improve the performance in datapath.

While updating around, 'igb_stats' parameter renamed as 'stats'.

Signed-off-by: Ferruh Yigit <ferruh.yi...@amd.com>
---
Cc: Mattias Rönnblom <mattias.ronnb...@ericsson.com>
Cc: Stephen Hemminger <step...@networkplumber.org>
Cc: Morten Brørup <m...@smartsharesystems.com>

This update triggered by mail list discussion [1].

[1]
https://inbox.dpdk.org/dev/3b2cf48e-2293-4226-b6cd-5f4dd3969...@lysator.liu.se/



NAK

I did not hear a good argument why atomic or volatile was necessary in the 
first place.
Why?


On the reader side, loads should be atomic.
On the writer side, stores should be atomic.

Updates (stores) should actually occur in a timely manner. The completeread-modify-write cycle need not be atomic, since we only have a singlewriter. All this for the per-lcore counter case.

If load or store tearing occurs, the counter values may occasionallytake totally bogus values. I think that should be avoided. Especiallysince it will likely come at a very reasonable cost.

From what it seems to me, load or store tearing may well occur. GCC maygenerate two 32-bit stores for a program-level 64-bit store on 32-bitx86. If you have constant and immediate-data store instructions,constant writes may also be end up teared. The kernel documentation hassome example of this. Add LTO, it's not necessarily going to be all thatclear what is storing-a-constant and what is not.

Maybe you care a little less if statistics are occasionally broken, orsome transient, inconsistent state, but generally they should work, andthey should never have some totally bogus values. So, statistics aren'tsnow flakes, mostly just business as usual.

We can't both have a culture that promotes C11-style parallelprogramming, or, at the extreme, push the C11 APIs as-is, and the say"and btw you don't have to care about the standard when it comes tostatistics".

We could adopt the Linux kernel's rules, programming model, and APIs(ignoring legal issues). That would be very old school, maybe somewhatover-engineered for our purpose, include a fair amount of inlineassembler, and also and may well depend on GCC or GCC-like compilers,just like what I believe the kernel does.

We could use something in-between, heavily inspired by C11 but stillwith an opportunity to work around compiler issues, library issues, and

extend the API for our use case.

I agree we shouldn't have to mark statistics _Atomic, or RTE_ATOMIC(),rte_atomic64_t, or rte_sometimes_atomic_and_sometimes_not64_t. Justkeeping the usual C integer types seems like a better option to me.

Why is this driver special (a snowflake) compared to all the other drivers 
doing software
statistics (tap, virtio, xdp, ring, memif, netvsc, vmware)?

If a broken piece of code has been copied around, one place is going tobe the first to be fixed.

Re: [RFC v3] net/af_packet: make stats reset reliable

Reply via email to