> On 7 Mar 2023, at 10:51, David Rowley <dgrowle...@gmail.com> wrote:
> 
> On Sun, 5 Mar 2023 at 13:21, Lukas Fittl <lu...@fittl.com> wrote:
>> Alternatively (or in addition) we could consider showing the "ndistinct" 
>> value that is calculated in cost_memoize_rescan - since that's the most 
>> significant contributor to the cache hit ratio (and you can influence that 
>> directly by improving the ndistinct statistics).
> 
> I think the ndistinct estimate plus the est_entries together would be
> useful. I think showing just the hit ratio number might often just
> raise too many questions about how that's calculated. To calculate the
> hit ratio we need to estimate the number of entries that can be kept
> in the cache at once and also the number of input rows and the number
> of distinct values.  We can see the input rows by looking at the outer
> side of the join in EXPLAIN, but we've no idea about the ndistinct or
> how many items the planner thought could be kept in the cache at once.
> 
> The plan node already has est_entries, so it should just be a matter
> of storing the ndistinct estimate in the Path and putting it into the
> Plan node so the executor has access to it during EXPLAIN.

Lukas: do you have an updated patch for this commitfest to address David's
comments?

--
Daniel Gustafsson



Reply via email to