PING for apply.

Patch has 2 acks.
And since it was signed off by a co-maintainer (myself), I don't think an ack 
from the other co-maintainer (Andrew) is required. Please correct me if I'm 
wrong?

-Morten

> -----Original Message-----
> From: Morten Brørup [mailto:m...@smartsharesystems.com]
> Sent: Tuesday, 18 June 2024 15.48
> To: Bruce Richardson
> Cc: honnappa.nagaraha...@arm.com; tho...@monjalon.net;
> andrew.rybche...@oktetlabs.ru; dev@dpdk.org; fengcheng...@huawei.com
> Subject: RE: [PATCH v7] mempool: test performance with larger bursts
> 
> > From: Bruce Richardson [mailto:bruce.richard...@intel.com]
> >
> > On Mon, Jun 10, 2024 at 10:56:00AM +0200, Morten Brørup wrote:
> > > PING (again) for review.
> > >
> > > Many applications use bursts of more than 32 packets,
> > > and some applications buffer more than 512 packets.
> > >
> > > This patch updates the mempool perf test accordingly.
> > >
> > > -Morten
> > >
> > > > From: Morten Brørup [mailto:m...@smartsharesystems.com]
> > > > Sent: Thursday, 4 April 2024 11.27
> > > >
> > > > PING for review. This patch is relatively trivial.
> > > >
> > > > > From: Morten Brørup [mailto:m...@smartsharesystems.com]
> > > > > Sent: Saturday, 2 March 2024 21.04
> > > > >
> > > > > Bursts of up to 64, 128 and 256 packets are not uncommon, so increase
> > the
> > > > > maximum tested get and put burst sizes from 32 to 256.
> > > > > For convenience, also test get and put burst sizes of
> > > > > RTE_MEMPOOL_CACHE_MAX_SIZE.
> > > > >
> > > > > Some applications keep more than 512 objects, so increase the maximum
> > > > > number of kept objects from 512 to 32768, still in jumps of factor
> four.
> > > > > This exceeds the typical mempool cache size of 512 objects, so the
> test
> > > > > also exercises the mempool driver.
> > > > >
> > > > > Increased the precision of rate_persec calculation by timing the
> actual
> > > > > duration of the test, instead of assuming it took exactly 5 seconds.
> > > > >
> > > > > Added cache guard to per-lcore stats structure.
> > > > >
> > > > > Signed-off-by: Morten Brørup <m...@smartsharesystems.com>
> > > > > Acked-by: Chengwen Feng <fengcheng...@huawei.com>
> > > > > ---
> > > > >
> > > > > v7:
> > > > > * Increase max burst size to 256. (Inspired by Honnappa)
> > > > > v6:
> > > > > * Do not test with more lcores than available. (Thomas)
> > > > > v5:
> > > > > * Increased N, to reduce measurement overhead with large numbers of
> kept
> > > > >   objects.
> > > > > * Increased precision of rate_persec calculation.
> > > > > * Added missing cache guard to per-lcore stats structure.
> >
> > This looks ok to me. However, the test itself takes a very long time to
> > run, with 5 seconds per iteration. One suggest I have is to reduce the
> > 5-seconds to 1-second - given we are looking at millions of iterations each
> > time, the difference in results should not be that great, I'd hope.
> 
> The test duration annoys me too.
> 
> Reducing the duration of each iteration would make the test more sensitive to
> short spikes of noise, e.g. from noisy neighbors in virtual environments.
> Someone once decided that 5 seconds was a good duration, and I didn't want to
> challenge that.
> 
> I also considered reducing the array of tested burst sizes, by jumping factor
> four here too; but I assume that both 32, 64, 128 and 256 are popular max
> burst sizes in applications, so I decided to keep them all, instead of
> omitting 32 and 128 and only keeping 64 and 256 to represent full bursts.
> 
> > A very
> > quick test of the delta on my end indicates variance in the first couple of
> > results of a couple of %, just.
> 
> Thanks for the review and suggestions, though.
> 
> >
> > With or without this suggestion.
> >
> > Acked-by: Bruce Richardson <bruce.richard...@intel.com>

Reply via email to