Re: [HACKERS] Patch: add timing of buffer I/O requests

Greg Smith Sun, 27 Nov 2011 23:55:15 -0800

On 11/27/2011 04:39 PM, Ants Aasma wrote:

On the AMD I saw about 3% performance drop with timing enabled. On the
Intel machine I couldn't measure any statistically significant change.

Oh no, it's party pooper time again. Sorry I have to be the one to doit this round. The real problem with this whole area is that we knowthere are systems floating around where the amount of time taken to grabtimestamps like this is just terrible. I've been annoyed enough by thatproblem to spend some time digging into why that is--seems to be a bunchof trivia around the multiple ways to collect time info on x86systems--and after this CommitFest is over I was already hoping to digthrough my notes and start quantifying that more. So you can't reallyprove the overhead of this approach is acceptable just by showing twoexamples; we need to find one of the really terrible clocks and testthere to get a real feel for the worst-case.

I recall a patch similar to this one was submitted by Greg Stark sometime ago. It used the info for different reasons--to try and figure outwhether reads were cached or not--but I believe it withered rather thanbeing implemented mainly because it ran into the same fundamentalroadblocks here. My memory could be wrong here, there were alsoconcerns about what the data would be used for.

I've been thinking about a few ways to try and cope with this wholeclass of timing problem:

-Document the underlying problem and known workarounds, provide a way totest how bad the overhead is, and just throw our hands up and say"sorry, you just can't instrument like this" if someone has a slow system.

-Have one of the PostgreSQL background processes keep track of a timeestimate on its own, only periodically pausing to sync against the realtime. Then most calls to gettimeofday() can use that value instead. Iwas thinking of that idea for slightly longer running things though; Idoubt that can be made accurate enough to test instrument buffer

And while I hate to kick off massive bike-shedding in your direction,I'm also afraid this area--collecting stats about how long individualoperations take--will need a much wider ranging approach than justlooking at the buffer cache ones. If you step back and ask "what dopeople expect here?", there's a pretty large number who really wantsomething like Oracle's v$session_wait and v$system_event interface forfinding the underlying source of slow things. There's enough demand forthat that EnterpriseDB has even done some work in this area too; whatI've been told about it suggests the code isn't a great fit forcontribution to community PostgreSQL though. Like I said, this area isreally messy and hard to get right.

Something more ambitious like the v$ stuff would also take care of whatyou're doing here; I'm not sure that what you've done helps built itthough. Please don't take that personally. Part of one of my owninstrumentation patches recently was rejected out of hand for the samereason, just not being general enough.


--
Greg Smith   2ndQuadrant US    [email protected]   Baltimore, MD
PostgreSQL Training, Services, and 24x7 Support  www.2ndQuadrant.us


--
Sent via pgsql-hackers mailing list ([email protected])
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Patch: add timing of buffer I/O requests

Reply via email to