On Tue, Nov 19, 2024 at 09:39:21AM -0500, Greg Sabino Mullane wrote: > Oh, and a +1 in general to the patch, OP, although it would also be nice to > start finding the bottlenecks that cause such performance issues.
FWIW, I'm not eager to integrate this proposal without looking at this exact argument in depth. One piece of it would be to see how much of such "bottlenecks" we would be able to get rid of by integrating pg_stat_statements into the central pgstats with the custom APIs, without pushing the module into core. This means that we would combine the existing hash of pgss to shrink to 8 bytes for objid rather than 13 bytes now as the current code relies on (toplevel, userid, queryid) for the entry lookup (entry removal is sniped with these three values as well, or dshash seq scans). The odds of conflicts still still play in our favor even if we have a few million entries, or even ten times that. This would also get rid of the pgss text file for the queries, which is a source of one of the bottlenecks, as we could just store query strings upper-bounded based on a postmaster GUC to control the size of the entries in the pgstats dshash. More normalization for IN and ANY clauses would also help a not here, these are a cause of a lot of bloat. This integration is not something I will be able to work on for the PG18 dev cycle as I'm in full review/commit mode for the rest of this release, but I got some plans for it in PG19 except if somebody beats me to it. -- Michael
signature.asc
Description: PGP signature