[gdal-dev] Performance regression testing/benchmarking for CI

Even Rouault via gdal-dev Tue, 10 Oct 2023 11:08:16 -0700

Hi,

I'm experimenting with adding performance regression testing in our CI.Currently our CI has quite extensive functional coverage, but totallylacks performance testing. Given that we use pytest, I've spottedpytest-benchmark (https://pytest-benchmark.readthedocs.io/en/latest/) asa likely good candidate framework.


I've prototyped things in https://github.com/OSGeo/gdal/pull/8538

Basically, we now have a autotest/benchmark directory where performancetests can be written.

Then in the CI, we checkout a reference commit, build it and run theperformance test suite in --benchmark-save mode

And then we run the performance test suite on the PR in--benchmark-compare mode with a --benchmark-compare-fail="mean:5%"criterion (which means that a test fails if its mean runtime is 5%slower than the reference one)

From what I can see, pytest-benchmark behaves correctly if tests areremoved or added (that is not failing, just skipping them duringcomparison). The only thing one should not do is modify an existing testw.r.t the reference branch.

Does someone has practical experience of pytest-benchmark, in particularin CI setups? With virtualization, it is hard to guarantee that otherthings happening on the host running the VM might not interfer. Evenlocally on my own machine, I initially saw strong variations in timings,which can be reduced to acceptable deviation by disabling IntelTurboboost feature (echo 1 | sudo tee/sys/devices/system/cpu/intel_pstate/no_turbo)


Even

--
http://www.spatialys.com
My software is free, but my time generally not.

_______________________________________________
gdal-dev mailing list
gdal-dev@lists.osgeo.org
https://lists.osgeo.org/mailman/listinfo/gdal-dev

[gdal-dev] Performance regression testing/benchmarking for CI

Reply via email to