Re: Introduce Index Aggregate - new GROUP BY strategy

Andrei Lepikhov Fri, 26 Dec 2025 07:21:03 -0800

On 12/12/25 17:23, Sergey Soloviev wrote:

Also, cost calculation logic is adjusted a bit - it takes into accounttop-down index
traversal and final external merge cost is added only if spill expected.


Hi,
Here is my 'aerial' review:

The patch proposes a new aggregation strategy that builds an in-memoryB+tree index for grouping. This combines incremental group formation(like AGG_HASHED) with sorted output (like AGG_SORTED), which isbeneficial when the query requires both grouping and ordering on(almost) the same columns.The key advantage is avoiding a separate sort step when the sortedoutput is needed, at the cost of additional CPU overhead.


My doubts:

1. Can you benchmark the scenario where the optimiser mispredictsnumGroups. If the planner underestimates group cardinality, the btreeoverhead could be much higher than expected. Does the approach degradegracefully?2. Consider splitting the hash_* → spill_* field renaming into aseparate preparatory commit to reduce the complexity of reviewing thecore logic changes.3. I notice AGG_INDEX requires both sortable AND hashable types. While Iunderstand this is for the hash-based spill partitioning, is thislimitation necessary? Could you use sort-based spilling (similar totuplesort's external merge) instead? This would allow AGG_INDEX to workwith sortable-only types (I can imagine a geometric type with B-treeoperators but no hash functions).

The main question for me is: can you invent a robust cost model to setsmooth boundaries between all three types of grouping? Does it reallypromise frequent benefits and avoid degradations? - Remember,increasing search space we increase planning time, which may be palpablein cases with many groupings/grouping attributes - for example, anAPPEND over a partitioned table with pushed-down aggregate looks like atrivial case.


--
regards, Andrei Lepikhov,
pgEdge

Re: Introduce Index Aggregate - new GROUP BY strategy

Reply via email to