On 07/10/17 04:19, Tom Lane wrote: > Andres Freund <and...@anarazel.de> writes: >> On 2017-10-06 21:33:16 -0400, Adam Brusselback wrote: >>> The article in question is here: >>> https://blog.jooq.org/2017/09/28/10-cool-sql-optimisations-that-do-not-depend-on-the-cost-model/ > >> That's interesting. > > The impression I have in a quick scan is that probably hardly any of these > are cases that any of the DB designers think are important in themselves. > Rather, they fall out of more general optimization attempts, or not, > depending on the optimization mechanisms in use in a particular DB. > For example, reducing "WHERE 1=1" to "WHERE TRUE" and then to nothing > comes out of a constant-subexpression-precalculation mechanism for us, > whereas "WHERE column=column" doesn't fall to that approach. ISTM it > would be really dumb to expend planner cycles looking specifically for > that case, so I guess that DB2 et al are finding it as a side-effect of > some more general optimization ... I wonder what that is? > > (edit: a few minutes later, I seem to remember that equivclass.c has > to do something special with the X=X case, so maybe it could do > something else special instead, with little new overhead.) >
What it actually does is to specifically skip the processing for X=X (the const expression will be simplified by estimate_expression_value/eval_const_expressions separately). There is comment there that specifically states that it's not worth it to process this as it's rare clause which is equal to X IS NOT NULL. I don't actually agree with the argument of the comment there, since in practice the if the "silly" equality is not there, we'll just waste equal() call and if it is there the optimization seems worth it as it will lead to orders of magnitude better estimation in many cases. So I wrote prototype of achieving this optimization and it seems to be really quite simple code-wise (see attached). I did only a limited manual testing of this but I don't see any negative impact on planning time. Thoughts? -- Petr Jelinek http://www.2ndQuadrant.com/ PostgreSQL Development, 24x7 Support, Training & Services
From a7d6f3fe0a5d42a96c711cc33e6962b9a2f6511f Mon Sep 17 00:00:00 2001 From: Petr Jelinek <pjmo...@pjmodos.net> Date: Sat, 7 Oct 2017 15:10:54 +0200 Subject: [PATCH] Transform X=X expressions into X IS NOT NULL --- src/backend/optimizer/path/equivclass.c | 12 ------------ src/backend/optimizer/plan/initsplan.c | 11 +++++++++++ 2 files changed, 11 insertions(+), 12 deletions(-) diff --git a/src/backend/optimizer/path/equivclass.c b/src/backend/optimizer/path/equivclass.c index 7997f50c18..d4ffdf66f2 100644 --- a/src/backend/optimizer/path/equivclass.c +++ b/src/backend/optimizer/path/equivclass.c @@ -154,18 +154,6 @@ process_equivalence(PlannerInfo *root, RestrictInfo *restrictinfo, collation); /* - * Reject clauses of the form X=X. These are not as redundant as they - * might seem at first glance: assuming the operator is strict, this is - * really an expensive way to write X IS NOT NULL. So we must not risk - * just losing the clause, which would be possible if there is already a - * single-element EquivalenceClass containing X. The case is not common - * enough to be worth contorting the EC machinery for, so just reject the - * clause and let it be processed as a normal restriction clause. - */ - if (equal(item1, item2)) - return false; /* X=X is not a useful equivalence */ - - /* * If below outer join, check for strictness, else reject. */ if (below_outer_join) diff --git a/src/backend/optimizer/plan/initsplan.c b/src/backend/optimizer/plan/initsplan.c index 9931dddba4..1c3672a978 100644 --- a/src/backend/optimizer/plan/initsplan.c +++ b/src/backend/optimizer/plan/initsplan.c @@ -2347,6 +2347,17 @@ process_implied_equality(PlannerInfo *root, return; } } + else if (equal(item1, item2)) + { + NullTest *ntest; + ntest = makeNode(NullTest); + ntest->arg = item1; + ntest->nulltesttype = IS_NOT_NULL; + ntest->argisrow = false; + ntest->location = exprLocation((Node *) clause); + + clause = (Expr *) ntest; + } /* * Push the new clause into all the appropriate restrictinfo lists. -- 2.11.0
-- Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org) To make changes to your subscription: http://www.postgresql.org/mailpref/pgsql-hackers