On Wed, Jan 19, 2022 at 09:12:27PM +0000, Dmitry Kozlyuk wrote:
> Hi Bruce,
> 
> > From: Bruce Richardson <bruce.richard...@intel.com>
> > [...]
> > this seems really interesting, but in the absense of TB of memory
> > being
> > used, is it easily possible to see the benefits of this work? I've
> > been
> > playing with adding large memory allocations to helloworld example and
> > checking the runtime. Allocating 1GB using malloc per thread seems to
> > show
> > a small (<0.5 second at most) benefit, and using a fixed 10GB
> > allocation
> > using memzone_reserve at startup shows runtimes within the margin of
> > error
> > when run with --huge-unlink=existing vs huge-unlink=never. At what
> > size of
> > memory footprint is it expected to make a clear improvement?
> 
> Sorry, there was a bug in v1 that completely broke the testing.
> I should've double-checked
> after what I considered a quick rebase before sending.
> Version 2 can be simply tested even without modifyin the code:
> 
> time sh -c 'echo quit | sudo ../_build/dpdk/app/test/dpdk-test
>       --huge-unlink=never -m 8192 --single-file-segments --no-pci
>       2>/dev/null >/dev/null'
> 
> With --huge-unlink=existing:
> real    0m1.450s
> user    0m0.574s
> sys     0m0.706s      (1)
> 
> With --huge-unlink=never, first run (no hugepage files to reuse):
> real    0m0.892s
> user    0m0.002s
> sys     0m0.718s      (2)
> 
> With --huge-unlink=never, second run (hugepage files left):
> real    0m0.210s
> user    0m0.010s
> sys     0m0.021s      (3)
> 
> Notice that (1) and (2) are close since there is no reuse,
> but (2) and (3) are differ by 0.7 seconds for 8GB,
> which correlates with 14 GB/sec memset() speed on this machine.
> Results without --single-file-segments are nearly identical.

Thanks, glad to hear it wasn't just me! I'll check again the v2 when I get
the chance.

Reply via email to