Re: [HACKERS] WIP: Fast GiST index build

Heikki Linnakangas Thu, 11 Aug 2011 03:29:29 -0700

On 10.08.2011 22:44, Alexander Korotkov wrote:

Manual and readme updates.


Thanks, I'm reviewing these now.

Do we want to expose the level-step and buffersize parameters to users?They've been useful during testing, but I'm thinking we should be ableto guess good enough values for them automatically, and just remove theoptions. It's pretty much impossible for a user to tune them correctly,it would require deep knowledge of the buffering algorithm.

I'm thinking that even when you explicitly turn buffering on, we shouldstill process the first 10000 or so tuples with simple inserts. That waywe always have a sample of tuples to calculate the average tuple sizefrom. It's plausible that if the input data is ordered, looking at thefirst N tuples will give skewed sample, but I don't think there's muchdanger of that in practice. Even if the data is ordered, the length ofGiST tuples shouldn't vary much.

What happens if we get the levelstep and pagesPerBuffer estimates wrong?How sensitive is the algorithm to that? Or will we run out of memory?Would it be feasible to adjust those in the middle of the index build,if we e.g exceed the estimated memory usage greatly?


--
  Heikki Linnakangas
  EnterpriseDB   http://www.enterprisedb.com

--
Sent via pgsql-hackers mailing list ([email protected])
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] WIP: Fast GiST index build

Reply via email to