[HACKERS] Configuring synchronous replication

Heikki Linnakangas Fri, 17 Sep 2010 01:10:29 -0700

(changed subject again.)

On 17/09/10 10:06, Simon Riggs wrote:

I don't think we can determine how far to implement without considering
both approaches in detail. With regard to your points below, I don't
think any of those points could be committed first.

Yeah, I think we need to decide on the desired feature set first, beforewe dig deeper into the the patches. The design and implementation willfall out of that.

That said, there's a few small things that can be progressed regardlessof the details of synchronous replication. There's the changes totrigger failover with a signal, and it seems that we'll need some libpqchanges to allow acknowledgments to be sent back to the masterregardless of the rest of the design. We can discuss those in separatethreads in parallel.

So the big question is what the user interface looks like. How does oneconfigure synchronous replication, and what options are available.Here's a list of features that have been discussed. We don't necessarilyneed all of them in the first phase, but let's avoid painting ourselvesin the corner.


* Support multiple standbys with various synchronization levels.

* What happens if a synchronous standby isn't connected at the moment?Return immediately vs. wait forever.


* Per-transaction control. Some transactions are important, others are not.

* Quorum commit. Wait until n standbys acknowledge. n=1 and n=allservers can be seen as important special cases of this.


* async, recv, fsync and replay levels of synchronization.

So what should the user interface be like? Given the 1st and 2ndrequirement, we need standby registration. If some standbys areimportant and others are not, the master needs to distinguish betweenthem to be able to determine that a transaction is safely delivered tothe important standbys.

For per-transaction control, ISTM it would be enough to have a simpleuser-settable GUC like synchronous_commit. Let's call it"synchronous_replication_commit" for now. For non-critical transactions,you can turn it off. That's very simple for developers to understand anduse. I don't think we need more fine-grained control than that attransaction level, in all the use cases I can think of you have a streamof important transactions, mixed with non-important ones like logmessages that you want to finish fast in a best-effort fashion. I'mactually tempted to tie that to the existing synchronous_commit GUC, theuse case seems exactly the same.

OTOH, if we do want fine-grained per-transaction control, a simpleboolean or even an enum GUC doesn't really cut it. For trulyfine-grained control you want to be able to specify exceptions like"wait until this is replayed in slave named 'reporting'" or 'don't waitfor acknowledgment from slave named 'uk-server'". With standbyregistration, we can invent a syntax for specifying overriding rules inthe transaction. Something like SET replication_exceptions ='reporting=replay, uk-server=async'.

For the control between async/recv/fsync/replay, I like to think interms of

a) asynchronous vs synchronous
b) if it's synchronous, how synchronous is it? recv, fsync or replay?

I think it makes most sense to set sync vs. async in the master, and thelevel of synchronicity in the slave. Although I have sympathy for theargument that it's simpler if you configure it all from the master sideas well.

Putting all of that together. I think Fujii-san's standby.conf is prettyclose. What it needs is the additional GUC for transaction-level control.


--
  Heikki Linnakangas
  EnterpriseDB   http://www.enterprisedb.com

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

[HACKERS] Configuring synchronous replication

Reply via email to