Re: MergeAppend could consider sorting cheapest child path

Andrei Lepikhov Thu, 31 Jul 2025 07:20:23 -0700

On 27/7/2025 00:51, Alexander Korotkov wrote:

On Tue, Jul 22, 2025 at 2:13 PM Andrei Lepikhov <[email protected]I've another idea. cost_tuplesort() puts 2.0 under logarithm to prefertuplesort over heapsort. I think we can adjust cost_gather_merge() andcost_merge_append() to do the same. 0001 patch implements that. Ithink the plan changes of 0001 might be reasonable since most cases dealwith small rowsets. One thing concerns me: 0002 still affects one ofthe postgres_fdw checks. Could you, please, take a look?

Thanks for the idea!
I analysed your approach a little bit.

Initially, I ran the test script I had created previously [1] anddiscovered that on a large scale (1e6 - 1e7 tuples), the plan stilldefaults to MergeAppend, which deviates from the execution time (7190 msfor Sort+Append and 8450 ms for MergeAppend+Sort).

Attempting to find out the reason, I combined all the costs into asingle formula for each strategy:


MergeAppend+Sort:

total_cost =CO*ntuples*(1+2*log(ntuples)) + Ccput * 0.5 * ntuples+2*CO*N*log(N) + A

Sort+Append:
total_cost = CO*ntuples*(1+2*log(ntuples))+ Ccput * 0.5 * ntuples + A

Terms:
- A - sum of total costs of underlying subtrees
- CO - cpu_operator_cost
- Ccput - cpu_tuple_cost
- N - number of subpaths (streams)

Given the significant gap in total execution time between thesestrategies, I believe it would be reasonable to introduce a coefficientto the equation's 'ntuples' variable component that will keep the gapbetween big quicksort and MergeAppend's heapsort out of the fuzzy factorgap.

Discovering papers on the value of constant in quicksort [2] andheapsort [3], I realised that there is a difference. The constant'svalue varies in a wide range: 1.3-1.5 for quicksort and 2-3 forheapsort. Considering that we should change the current cost model aslittle as possible, not to break the balance, we may just increase theconstant value for the heap sort to maintain a bare minimum gap betweenstrategies out of the fuzzy factor. In this case, the merge appendconstant should be around 3.8 - 4.0.

With this minor change, we see a shift in the regression tests. Most ofthese changes were introduced by the new append strategy. Although Ihaven't analysed these changes in depth yet, I believe they are allrelated to the small data sets and should fade out on a larger scale.

See this minor correction in the attachment. postgres_fdw tests arestable now.

[1]https://github.com/danolivo/conf/blob/main/Scripts/sort-vs-mergeappend-3.sql

[2] https://en.wikipedia.org/wiki/Quicksort?utm_source=chatgpt.com
[2] https://arxiv.org/abs/1504.01459

--
regards, Andrei Lepikhov

From cca6ed05cf8128a1e88ea07021ba21953cbc1a6b Mon Sep 17 00:00:00 2001
From: "Andrei V. Lepikhov" <[email protected]>
Date: Thu, 31 Jul 2025 14:53:08 +0200
Subject: [PATCH v9 1/2] Sketch

---
 src/backend/optimizer/path/costsize.c | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/src/backend/optimizer/path/costsize.c 
b/src/backend/optimizer/path/costsize.c
index 344a3188317..c353001c581 100644
--- a/src/backend/optimizer/path/costsize.c
+++ b/src/backend/optimizer/path/costsize.c
@@ -512,7 +512,7 @@ cost_gather_merge(GatherMergePath *path, PlannerInfo *root,
        logN = LOG2(N);
 
        /* Assumed cost per tuple comparison */
-       comparison_cost = 2.0 * cpu_operator_cost;
+       comparison_cost = 3.9 * cpu_operator_cost;
 
        /* Heap creation cost */
        startup_cost += comparison_cost * N * logN;
@@ -2474,7 +2474,7 @@ cost_merge_append(Path *path, PlannerInfo *root,
        logN = LOG2(N);
 
        /* Assumed cost per tuple comparison */
-       comparison_cost = 2.0 * cpu_operator_cost;
+       comparison_cost = 3.9 * cpu_operator_cost;
 
        /* Heap creation cost */
        startup_cost += comparison_cost * N * logN;
-- 
2.50.1

Re: MergeAppend could consider sorting cheapest child path

Reply via email to