Re: [HACKERS] Hard limit on WAL space used (because PANIC sucks)

MauMau Sun, 09 Jun 2013 15:41:15 -0700

From: "Craig Ringer" <cr...@2ndquadrant.com>

On 06/09/2013 08:32 AM, MauMau wrote:


- Failure of a disk containing data directory or tablespace
If checkpoint can't write buffers to disk because of disk failure,
checkpoint cannot complete, thus WAL files accumulate in pg_xlog/.
This means that one disk failure will lead to postgres shutdown.


I've seen a couple of people bitten by the misunderstanding that
tablespaces are a way to split up your data based on different
reliability requirements, and I really need to write a docs patch for
http://www.postgresql.org/docs/current/static/manage-ag-tablespaces.html
<http://www.postgresql.org/docs/9.2/static/manage-ag-tablespaces.html>
that adds a prominent warning like:

WARNING: Every tablespace must be present before the database can be
started. There is no easy way to recover the database if a tablespace is
lost to disk failure, deletion, use of volatile storage, etc. <b>Do not
put a tablespace on a RAM disk</b>; instead just use UNLOGGED tables.

(Opinions on the above?)

Yes, I'm sure this is useful for DBAs to know how postgres behaves and takesome preparations. However, this does not apply to my case, because I'musing tablespaces for I/O distribution across multiple disks and simply fordatabase capacity.

The problem is that the reliability of the database system decreases withmore disks, because failure of any one of those disks would result in adatabase PANIC shutdown

I'd rather like to be able to recover from this by treating the
tablespace as dead, so any attempt to get a lock on any table within it
fails with an error and already-in-WAL writes to it just get discarded.
It's the sort of thing that'd only be reasonable to do as a recovery
option (like zero_damaged_pages) since if applied by default it'd lead
to potentially severe and unexpected data loss.

I'm in favor of taking a tablespace offline when I/O failure is encountered,and continue running the database server. But WAL must not be discardedbecause committed transactions must be preserved for durability of ACID.


Postgres needs to take these steps when it encounters an I/O error:

1. Take the tablespace offline, so that subsequent read/write against itreturns an error without actually issuing read/write against data files.


2. Discard shared buffers containing data in the tablespace.

WAL is not affected by the offlining of tablespaces. WAL records alreadywritten on the WAL buffer will be written to pg_xlog/ and archived as usual.Those WAL records will be used to recover committed transactions duringarchive recovery.


Regards
MauMau



--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Hard limit on WAL space used (because PANIC sucks)

Reply via email to