Re: [HACKERS] Proposal : For Auto-Prewarm.

Konstantin Knizhnik Tue, 30 May 2017 00:06:07 -0700

On 27.10.2016 14:39, Mithun Cy wrote:

# pg_autoprewarm.
This a PostgreSQL contrib module which automatically dump all of theblocknumspresent in buffer pool at the time of server shutdown(smart and fastmode only,to be enhanced to dump at regular interval.) and load these blockswhen server restarts.
Design:
------
We have created a BG Worker Auto Pre-warmer which during shutdowndumps all the
blocknum in buffer pool in sorted order.
Format of each entry is<DatabaseId,TableSpaceId,RelationId,Forknum,BlockNum>.Auto Pre-warmer is started as soon as the postmaster is started we donot waitfor recovery to finish and database to reach a consistent state. Ifthere is a
"dump_file" to load we start loading each block entry to buffer pool until
there is a free buffer. This way we do not replace any new blockswhich wasloaded either by recovery process or querying clients. Then it waitsuntil it receives
SIGTERM to dump the block information in buffer pool.

HOW TO USE:
-----------
Build and add the pg_autoprewarm to shared_preload_libraries. AutoPre-warmerprocess automatically do dumping of buffer pool's block info and loadthem when
restarted.

TO DO:
------
Add functionality to dump based on timer at regular interval.
And some cleanups.


I wonder if you considered parallel prewarming of a table?

Right now either with pg_prewarm, either with pg_autoprewarm, preloadingtable's data is performed by one backend.It certainly makes sense if there is just one HDD and we want tominimize impact of pg_prewarm on normal DBMS activity.But sometimes we need to load data in memory as soon as possible. Andmodern systems has larger number of CPU cores and

RAID devices make it possible to efficiently load data in parallel.

I have asked this question in context of my CFS (compressed file system)for Postgres. The customer's complaint was that there are 64 cores athis system but whenhe is building index, decompression of heap data is performed by onlyone core. This is why I thought about prewarm... (parallel indexconstruction is separate story...)

pg_prewarm makes is possible to specify range of blocks, so, inprinciple, it is possible to manually preload table in parallel, byspawining pg_prewarmwith different subranges in several backends. But it is definitely notuser friendly approach.And as far as I understand pg_autoprewarm has all necessaryinfrastructure to do parallel load. We just need to spawn more than onebackground worker and specify

separate block range for each worker.

Do you think that such functionality (parallel autoprewarm) can beuseful and be easily added?


--
Konstantin Knizhnik
Postgres Professional: http://www.postgrespro.com
The Russian Postgres Company



--
Sent via pgsql-hackers mailing list ([email protected])
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Proposal : For Auto-Prewarm.

Reply via email to