Re: [GENERAL] Why won't it index scan?

Peter Kovacs Wed, 17 May 2006 23:53:57 -0700

Sorry for the naive question, but: is there a problem with analyze doingfull table scans? Analyze will not lock anything, will it?


Peter


Greg Stark wrote:

Tom Lane <[EMAIL PROTECTED]> writes:

"Ed L." <[EMAIL PROTECTED]> writes:
So, does this sound like we just happened to get repeatedlyhorribly unrepresentative random samples with stats target at10? Are we at the mercy of randomness here? Or is there abetter preventive procedure we can follow to systematicallyidentify this kind of situation?
I think the real issue is that stats target 10 is too small for large
tables: the samples are just not large enough to support a decent
numdistinct estimate, which is the critical stat for cases such as this
(ie, estimating the number of hits on a value that's not in the
most-common-values list).


There's been some discussion on -hackers about this area. Sadly the idea of
using samples to calculate numdistinct estimates is fundamentally on pretty
shaky ground.

Whereas a fixed sample size works fine for calculating distribution of values,
in order to generate consistent precision for numdistinct estimates the
samples will have to be a constant fraction of the table -- and unfortunately
a pretty large fraction at that.

So sadly I think "at the mercy of randomness" is pretty accurate. You'll have
to raise the statistics target as the table grows and I expect you'll
eventually run into some downsides of large stats targets.

Some better algorithms were posted, but they would require full table scans
during analyze, not just samples.


---------------------------(end of broadcast)---------------------------
TIP 2: Don't 'kill -9' the postmaster

Re: [GENERAL] Why won't it index scan?

Reply via email to