When testing commit c6b9204 with CLOBBER_CACHE_ALWAYS, of the 20 hours for
check-world, 001_rep_changes.pl took 1.8 hours.  At commit 5406513, the test
failed at a poll_query_until() timeout[1].  The slow part is the logical
replication of "DELETE FROM tab_ins WHERE a > 0", which deletes 100 records
from a table of ~1100 records, using RelationFindReplTupleSeq().
tuples_equal() called lookup_type_cache() for every comparison.  Performing
those lookups once per RelationFindReplTupleSeq(), as attached, cut the test's
runtime by an order of magnitude.  While performance for CLOBBER_CACHE_ALWAYS
is not important, this is consistent with record_eq() and is easy.  I'm
slightly inclined not to back-patch it, though.

[1] This seemed to result from the poll query being 2-3x faster at commit
5406513, not from logical replication being slower.  (poll_query_until() times
out after 1800 polls separated by 0.1s sleeps, however long that takes.)  I
had guessed that commit 1c7a0b3 greatly accelerated this test case, but it
gave about a 4% improvement under CLOBBER_CACHE_ALWAYS.
Author:     Noah Misch <n...@leadboat.com>
Commit:     Noah Misch <n...@leadboat.com>

    Optimize RelationFindReplTupleSeq() for CLOBBER_CACHE_ALWAYS.
    
    Specifically, remember lookup_type_cache() results instead of retrieving
    them once per comparison.  Under CLOBBER_CACHE_ALWAYS, this reduced
    src/test/subscription/t/001_rep_changes.pl elapsed time by an order of
    magnitude, which reduced check-world elapsed time by 9%.
    
    Reviewed by FIXME.
    
    Discussion: https://postgr.es/m/FIXME

diff --git a/src/backend/executor/execReplication.c 
b/src/backend/executor/execReplication.c
index 7194bec..58448ba 100644
--- a/src/backend/executor/execReplication.c
+++ b/src/backend/executor/execReplication.c
@@ -225,7 +225,8 @@ retry:
  * Compare the tuples in the slots by checking if they have equal values.
  */
 static bool
-tuples_equal(TupleTableSlot *slot1, TupleTableSlot *slot2)
+tuples_equal(TupleTableSlot *slot1, TupleTableSlot *slot2,
+                        TypeCacheEntry **eq)
 {
        int                     attrnum;
 
@@ -256,12 +257,18 @@ tuples_equal(TupleTableSlot *slot1, TupleTableSlot *slot2)
 
                att = TupleDescAttr(slot1->tts_tupleDescriptor, attrnum);
 
-               typentry = lookup_type_cache(att->atttypid, 
TYPECACHE_EQ_OPR_FINFO);
-               if (!OidIsValid(typentry->eq_opr_finfo.fn_oid))
-                       ereport(ERROR,
-                                       (errcode(ERRCODE_UNDEFINED_FUNCTION),
-                                        errmsg("could not identify an equality 
operator for type %s",
-                                                       
format_type_be(att->atttypid))));
+               typentry = eq[attrnum];
+               if (typentry == NULL)
+               {
+                       typentry = lookup_type_cache(att->atttypid,
+                                                                               
 TYPECACHE_EQ_OPR_FINFO);
+                       if (!OidIsValid(typentry->eq_opr_finfo.fn_oid))
+                               ereport(ERROR,
+                                               
(errcode(ERRCODE_UNDEFINED_FUNCTION),
+                                                errmsg("could not identify an 
equality operator for type %s",
+                                                               
format_type_be(att->atttypid))));
+                       eq[attrnum] = typentry;
+               }
 
                if (!DatumGetBool(FunctionCall2Coll(&typentry->eq_opr_finfo,
                                                                                
        att->attcollation,
@@ -290,12 +297,15 @@ RelationFindReplTupleSeq(Relation rel, LockTupleMode 
lockmode,
        TupleTableSlot *scanslot;
        TableScanDesc scan;
        SnapshotData snap;
+       TypeCacheEntry **eq;
        TransactionId xwait;
        bool            found;
        TupleDesc       desc PG_USED_FOR_ASSERTS_ONLY = RelationGetDescr(rel);
 
        Assert(equalTupleDescs(desc, outslot->tts_tupleDescriptor));
 
+       eq = palloc0(sizeof(*eq) * outslot->tts_tupleDescriptor->natts);
+
        /* Start a heap scan. */
        InitDirtySnapshot(snap);
        scan = table_beginscan(rel, &snap, 0, NULL);
@@ -309,7 +319,7 @@ retry:
        /* Try to find the tuple */
        while (table_scan_getnextslot(scan, ForwardScanDirection, scanslot))
        {
-               if (!tuples_equal(scanslot, searchslot))
+               if (!tuples_equal(scanslot, searchslot, eq))
                        continue;
 
                found = true;

Reply via email to