On Fri, Dec 17, 2021 at 09:41:32AM +0000, Chris Lowder wrote: > Thank you for that (speedy) explanation and patch Bruce! That all makes > perfect > sense.
There are actually two questions being addressed here: 1. In what cases are new index entries added, and for which indexes? 2. In what cases are index expressions evaluated? For 1, I explained that for updates we either add new rows for all indexes, or none of them. We create new index rows for non-HOT updates, which happens if the updated row does not change indexed columns (or columns involved in index expressions), and if the new row fits on the same 8k heap page. What I recently learned, at least for PG 10-14, is that the index expression is only called for non-HOT updates, not HOT ones. Postgres does not evaluate the index expression to determine if the indexed expression has changed, but rather relies on column value comparisons. I wrote the attached script which creates an expression index function with a pg_sleep(1) call to determine if the function is called. If you run the attached SQL script in psql, you will see that times of ~1 second happens only when there is a change in the colummn referenced by the index expression or when the new row is stored on a new heap page (ctid page number changes). The expression index function is not called if the indexed column value does not change and remains on the same heap page. I am attaching an updated patch which mentions non-HOT updates for expression indexes. -- Bruce Momjian <br...@momjian.us> https://momjian.us EDB https://enterprisedb.com If only the physical world exists, free will is an illusion.
expr.sql
Description: application/sql
diff --git a/doc/src/sgml/indices.sgml b/doc/src/sgml/indices.sgml index 56fbd45178..023157d888 100644 --- a/doc/src/sgml/indices.sgml +++ b/doc/src/sgml/indices.sgml @@ -748,8 +748,8 @@ CREATE INDEX people_names ON people ((first_name || ' ' || last_name)); <para> Index expressions are relatively expensive to maintain, because the - derived expression(s) must be computed for each row upon insertion - and whenever it is updated. However, the index expressions are + derived expression(s) must be computed for each row insertion + and non-HOT update. However, the index expressions are <emphasis>not</emphasis> recomputed during an indexed search, since they are already stored in the index. In both examples above, the system sees the query as just <literal>WHERE indexedcolumn = 'constant'</literal>