Re: [HACKERS] New FSM patch

Zdenek Kotala Fri, 12 Sep 2008 05:16:55 -0700

Heikki Linnakangas napsal(a):

Heikki Linnakangas wrote:
I've also been working on a low level benchmark using a C user-definedfunction that exercises just the FSM, showing the very raw CPUperformance vs. current implementation. More on that later, but ATM itlooks like the new implementation can be faster or slower than thecurrent one, depending on the table size.
Let me describe this test case first:
- The test program calls RecordAndGetPageWithFreeSpace in a tight loop,with random values. There's no activity to the heap. In normal usage,the time spent in RecordAndGetWithFreeSpace is minuscule compared to theheap and index updates that cause RecordAndGetWithFreeSpace to be called.- WAL was placed on a RAM drive. This is of course not how people set uptheir database servers, but the point of this test was to measure CPUspeed and scalability. The impact of writing extra WAL is significantand needs to be taken into account, but that's a separate test anddiscussion, and needs to be considered in comparison to the WAL writtenby heap and index updates.
That said, the test results are pretty interesting.
I ran the test using a custom scripts with pgbench. I ran it withdifferent table sizes, and with 1 or 2 clients, on CVS HEAD and apatched version. The unit is "thousands of RecordAndGetPageWithFreeSpacecalls per second":
Table size    Patched            CVS HEAD
        1 clnt    2 clnts    1 clnt    2 clients
8 kB        4.59    3.45    62.83    26.85
336 kB        13.85    6.43    41.8    16.55
3336 kB        14.96    6.3    22.45    10.55
33336 kB    14.85    6.56    5.44    4.08
333336 kB    14.48    11.04    0.79    0.74
3333336 kB    12.68    11.5    0.07    0.07
33333336 kB    7.67    5.37    0.05    0.05
The big surprise to me was that performance on CVS HEAD tanks as thetable size increases. One possible explanation is that searches for Xbytes of free space, for a very high X, will not find any matches, andthe current FSM implementation ends up scanning through the whole FSMlist for that relation.
Another surprise was how badly both implementations scale. On CVS HEAD,I expected the performance to be roughly the same with 1 and 2 clients,because all access to the FSM is serialized on the FreeSpaceLock. Butadding the 2nd client not only didn't help, but it actually made theperformance much worse than with a single client. Context switching orcache line contention, perhaps?

The new FSM implementation shows thesame effect, which was an even bigger surprise. At table sizes > 32 MB,the FSM no longer fits on a single FSM page, so I expected almost alinear speed up with bigger table sizes from using multiple clients.That's not happening, and I don't know why. Although, going from 33MB to333 MB, the performance with 2 clients almost doubles, but it stilldoesn't exceed that with 1 client.

It looks likes that there are lot of lock issues on FSM pages. When number ofFSM pages is increased then number of collisions is lower. It is probably why 2clients significantly speed up between 33MB and 333MB. I think it is time totake DTrace ;-).Do you have any machine with DTrace support? If not send me your test suit and Iwill try it run on my machine.


                Zdenek





--
Zdenek Kotala              Sun Microsystems
Prague, Czech Republic     http://sun.com/postgresql


--
Sent via pgsql-hackers mailing list ([email protected])
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] New FSM patch

Reply via email to