Re: Multiple Query IDs for a rewritten parse tree

Andrey V. Lepikhov Mon, 31 Jan 2022 01:59:35 -0800

On 1/28/22 9:51 PM, Dmitry Dolgov wrote:

On Fri, Jan 21, 2022 at 11:33:22AM +0500, Andrey V. Lepikhov wrote:
Registration of an queryId generator implemented by analogy with extensible
methods machinery.


Why not more like suggested with stakind and slots in some data
structure? All of those generators have to be iterated anyway, so not
sure if a hash table makes sense.

Maybe. But it is not obvious. We don't really know, how many extensionscould set an queryId.For example, adaptive planning extensions definitely wants to set anunique id (for example, simplistic counter) to trace specific{query,plan} across all executions (remember plancache too). And theywould register a personal generator for such purpose.

Also, I switched queryId to int64 type and renamed to
'label'.


A name with "id" in it would be better I believe. Label could be think
of as "the query belongs to a certain category", while the purpose is
identification.

I think, it is not a full true. Current jumbling generates not uniquequeryId (i hope, intentionally) and pg_stat_statements uses queryId togroup queries into classes.For tracking specific query along execution path it performs additionalefforts (to remember nesting query level, as an example).BTW, before [1], I tried to improve queryId, that can be stable forpermutations of tables in 'FROM' section and so on. It would allow toreduce a number of pg_stat_statements entries (critical factor when youuse an ORM, like 1C for example).

So, i think queryId is an Id and a category too.

2. We need a custom queryId, that is based on a generated queryId (according
to the logic of pg_stat_statements).


Could you clarify?

pg_stat_statements uses origin queryId and changes it for a reason(sometimes zeroed it, sometimes not). So you can't use this value inanother extension and be confident that you use original value,generated by JumbleQuery(). Custom queryId allows to solve this problem.

4. We should reserve position of default in-core generator


 From the discussion above I was under the impression that the core
generator should be distinguished by a predefined kind.

Yes, but I think we should have a range of values, enough for use inthird party extensions.

5. We should add an EXPLAIN hook, to allow an extension to print this custom
queryId.


Why? It would make sense if custom generation code will be generating
some complex structure, but the queryId itself is still a hash.

Extension can print not only queryId, but an explanation of a kind,maybe additional logic.Moreover why an extension can't show some useful monitoring data,collected during an query execution, in verbose mode?

[1]https://www.postgresql.org/message-id/flat/e50c1e8f-e5d6-5988-48fa-63dd992e9565%40postgrespro.ru

--
regards,
Andrey Lepikhov
Postgres Professional

Re: Multiple Query IDs for a rewritten parse tree

Reply via email to