Re: [HACKERS] COMMIT NOWAIT Performance Option

Heikki Linnakangas Thu, 01 Mar 2007 05:55:26 -0800

Zeugswetter Andreas ADI SD wrote:

Maybe a suitable replacement for full-page would be to sync the
first
WAL record for a page change before writing the buffer
We *always* sync WAL records for page changes before writingthe buffer for the page.
Um, is that so ? And how is that done ? (e.g. bgwriter would need to
wait for or even initiate a WAL record sync)
Seems this is not clear to others eighter.

Yes. The bgwriter or backend that flushes the buffer indeed initiates aWAL sync if the WAL record corresponding the LSN of the buffer hasn'tbeen flushed yet.

Why else would people think we need a full data scan after crash without
full-pages ?


They're confused. ;-)

If this is true, all pages with possibly pending IO during crash will be
touched and could thus be checked during startup recovery.


That's right. Torn page detection wouldn't require any extra I/O, just CPU.

I can see that we need to sync wal before write for certain changes, but
I don't see why we would currently need it for e.g. a heap insert, that
gets a free slot.

The corresponding index insert might hit the disk but the heap insertmight not. Now if someone after the crash uses the same heap slot, you'dend up with the index tuple of the aborted transaction pointing to wrongheap tuple. There might be other error scenarios, that's just the firstexample I could think of.

In practice, it's extremely rare for a writer to need to do the WALflush. When a buffer is chosen as a victim, it's been unused for sometime, and the WAL has already been flushed. It does happen when you'revacuuming, see Simon's recent thread, but we should fix that anyway. Italso happens on checkpoints, but it's not a big performance issue.


--
  Heikki Linnakangas
  EnterpriseDB   http://www.enterprisedb.com

---------------------------(end of broadcast)---------------------------
TIP 5: don't forget to increase your free space map settings

Re: [HACKERS] COMMIT NOWAIT Performance Option

Reply via email to