Hi Bruce, > From: Bruce Richardson <bruce.richard...@intel.com> > [...] > this seems really interesting, but in the absense of TB of memory > being > used, is it easily possible to see the benefits of this work? I've > been > playing with adding large memory allocations to helloworld example and > checking the runtime. Allocating 1GB using malloc per thread seems to > show > a small (<0.5 second at most) benefit, and using a fixed 10GB > allocation > using memzone_reserve at startup shows runtimes within the margin of > error > when run with --huge-unlink=existing vs huge-unlink=never. At what > size of > memory footprint is it expected to make a clear improvement?
Sorry, there was a bug in v1 that completely broke the testing. I should've double-checked after what I considered a quick rebase before sending. Version 2 can be simply tested even without modifyin the code: time sh -c 'echo quit | sudo ../_build/dpdk/app/test/dpdk-test --huge-unlink=never -m 8192 --single-file-segments --no-pci 2>/dev/null >/dev/null' With --huge-unlink=existing: real 0m1.450s user 0m0.574s sys 0m0.706s (1) With --huge-unlink=never, first run (no hugepage files to reuse): real 0m0.892s user 0m0.002s sys 0m0.718s (2) With --huge-unlink=never, second run (hugepage files left): real 0m0.210s user 0m0.010s sys 0m0.021s (3) Notice that (1) and (2) are close since there is no reuse, but (2) and (3) are differ by 0.7 seconds for 8GB, which correlates with 14 GB/sec memset() speed on this machine. Results without --single-file-segments are nearly identical.