Re: [BUGS] BUG #2930: Hash join abyssmal with many null fields.

Maciej Babinski Fri, 26 Jan 2007 12:10:25 -0800

Tom Lane wrote:

"Maciej Babinski" <[EMAIL PROTECTED]> writes:

Hash join of columns with many null fields is very slow unless the null
fields are commented out.


I see no bug here.  AFAICT your "much faster" query gets that way by
having eliminated all the candidate join rows on the B side.

                        regards, tom lane

The additional clause eliminates no rows beyond what the existing clausewould.Any row eliminated by "b.join_id IS NOT NULL" could not possibly havesatisfied

"a.join_id = b.join_id".

Please note that if the join columns are not null, but still produce nomatchesfor the join, the results are fast without the need for an extra clausein the join:


DROP TABLE a;
DROP TABLE b;

CREATE TABLE a (id integer, join_id integer);
CREATE TABLE b (id integer, join_id integer);

INSERT INTO a (id) SELECT generate_series(1,10000);
INSERT INTO b (id) SELECT generate_series(1,10000);

ANALYZE a;
ANALYZE b;

EXPLAIN ANALYZE SELECT * FROM a JOIN b ON a.join_id = b.join_id; /* 14seconds */EXPLAIN ANALYZE SELECT * FROM a JOIN b ON a.join_id = b.join_id ANDb.join_id IS NOT NULL; /* 5ms */


UPDATE a SET join_id=1;
UPDATE b SET join_id=2;

EXPLAIN ANALYZE SELECT * FROM a JOIN b ON a.join_id = b.join_id; /* 72ms */

EXPLAIN ANALYZE SELECT * FROM a JOIN b ON a.join_id = b.join_id ANDb.join_id != 2; /* 48ms */

It seems to me that such a wild disparity in performance due to theaddition of a clause that isimplied by the existing clause should be considered a bug, but if I needto submit a feature

request for the optimizer, then I'd be happy to. Thanks!

Maciej Babinski

---------------------------(end of broadcast)---------------------------
TIP 2: Don't 'kill -9' the postmaster

Re: [BUGS] BUG #2930: Hash join abyssmal with many null fields.

Reply via email to