why is pg_upgrade's regression run so slow?

Andrew Dunstan Sat, 27 Jul 2024 06:08:39 -0700

As part of its 002_pgupgrade.pl, the pg_upgrade tests do a rerun of thenormal regression tests. That in itself is sad, but what concerns mehere is why it's so much slower than the regular run? This is apparenteverywhere (e.g. on crake the standard run takes about 30 to 90 s, butpg_upgrade's run takes 5 minutes or more). On Windows, it'scatastrophic, and only hasn't been noticed because the buildfarm clientwasn't counting a timeout as a failure. That was an error on my part andI have switched a few of my machines to code that checks more robustlyfor failure of meson tests - specifically by looking for the absence oftest.success rather than the presence of test.fail. That means thatdrongo and fairywren are getting timeout errors. e.g. on the latest runon fairywren, the regular regression run took 226s, but pg_upgrade's runof what should be the same set of tests took 2418s. What the heck isgoing on here? Is it because there are the concurrent tests running?That doesn't seem enough to make the tests run more than 10 times as long.

I have a strong suspicion this is exacerbated by "debug_parallel_query =regress", especially since the tests run much faster on REL_17_STABLEwhere I am not setting that, but that can't be the whole explanation,since that setting should apply to both sets of tests.



cheers


andrew

--
Andrew Dunstan
EDB: https://www.enterprisedb.com

why is pg_upgrade's regression run so slow?

Reply via email to