-------------- Original message ---------------------- From: William Garrison <[EMAIL PROTECTED]> > I am looking for records with duplicate keys, so I am running this query: > > SELECT > fileid, COUNT(*) > FROM > file > GROUP BY > fileid > HAVING > COUNT(*)>1 > > The table has an index on fileid (non-unique index) so I am surprised > that postgres is doing a table scan. This database is >15GB, and there > are a number of fairly large string columns in the table. I am very > surprised that scanning the index is not faster than scanning the > table. Any thoughts on that? Is scanning the table faster than > scanning the index? Is there a reason that it needs anything other than > the index? >
I may be missing something, but it would have to scan the entire table to get all the occurrences of each fileid in order to do the count(*). -- Adrian Klaver [EMAIL PROTECTED] -- Sent via pgsql-general mailing list (pgsql-general@postgresql.org) To make changes to your subscription: http://www.postgresql.org/mailpref/pgsql-general