Re: Dangerous Naming Confusion

Adrian Klaver Mon, 29 Mar 2021 15:20:22 -0700

On 3/29/21 3:00 PM, Don Seiler wrote:

Good evening,
Please see my gist athttps://gist.github.com/dtseiler/9ef0a5e2b1e0efc6a13d5661436d4056<https://gist.github.com/dtseiler/9ef0a5e2b1e0efc6a13d5661436d4056> fora complete test case.
I tested this on PG 12.6 and 13.2 and observed the same on both.
We were expecting the queries that use dts_temp to only return 3 rows.However the subquery starting at line 36 returns ALL 250,000 rows fromdts_orders. Note that the "order_id" field doesn't exist in the dts_temptable, so I'm assuming PG is using the "order_id" field from thedts_orders table. If I use explicit table references like in the queryat line 48, then I get the error I would expect that the "order_id"column doesn't exist in dts_temp.
When I use the actual column name "a" for dts_temp, then I get the 3rows back as expected.
I'm wondering if this is expected behavior that PG uses thedts_orders.order_id value in the subquery "select order_id fromdts_temp" when dts_temp doesn't have its own order_id column. I wouldhave expected an error that the column doesn't exist. Seems verycounter-intuitive to think PG would use a column from a different table.


See:

https://www.postgresql.org/message-id/[email protected]

This issue was discovered today when this logic was used in an UPDATEand ended up locking all rows in a 5M row table and brought many apps toa grinding halt. Thankfully it was caught and killed before it actuallyupdated anything.
Thanks,
Don.
--
Don Seiler
www.seiler.us <http://www.seiler.us>



--
Adrian Klaver
[email protected]

Re: Dangerous Naming Confusion

Reply via email to