Re: sequences vs. synchronous replication

Tomas Vondra Wed, 22 Dec 2021 11:00:32 -0800

On 12/22/21 18:50, Fujii Masao wrote:

On 2021/12/22 21:11, Tomas Vondra wrote:
Interesting idea, but I think it has a couple of issues :-(
Thanks for the review!
1) We'd need to know the LSN of the last WAL record for any givensequence, and we'd need to communicate that between backends somehow.Which seems rather tricky to do without affecting performance.
How about using the page lsn for the sequence? nextval_internal()already uses that to check whether it's less than or equal to checkpointredo location.


Hmm, maybe.

2) SyncRepWaitForLSN() is used only in commit-like situations, andit's a simple wait, not a decision to write more WAL. Environmentswithout sync replicas are affected by this too - yes, the data lossissue is not there, but the amount of WAL is still increased.
How about reusing only a part of code in SyncRepWaitForLSN()? Attachedis the PoC patch that implemented what I'm thinking.
IIRC sync_standby_names can change while a transaction is running,even just right before commit, at which point we can't just go back intime and generate WAL for sequences accessed earlier. But we stillneed to ensure the sequence is properly replicated.
Yes. In the PoC patch, SyncRepNeedsWait() still checkssync_standbys_defined and uses SyncRepWaitMode. But they should not bechecked nor used because their values can be changed on the fly, as youpointed out. Probably SyncRepNeedsWait() will need to be changed so thatit doesn't use them.

Right. I think the data loss with sync standby is merely a symptom, notthe root cause. We'd need to deduce the LSN for which to wait at commit.

3) I don't think it'd actually reduce the amount of WAL records inenvironments with many sessions (incrementing the same sequence). Inthose cases the WAL (generated by in-progress xact from anothersession) is likely to not be flushed, so we'd generate the extra WALrecord. (And if the other backends would need flush LSN of this newWAL record, which would make it more likely they have to generate WALtoo.)
With the PoC patch, only when previous transaction that executednextval() and caused WAL record is aborted, subsequent nextval()generates additional WAL record. So this approach can reduce WAL volumethan other approach?
 > In the PoC patch, to reduce WAL volume more, it might be better to make
nextval_internal() update XactLastRecEnd and assign XID rather thanemitting new WAL record, when SyncRepNeedsWait() returns true.

Yes, but I think there are other cases. For example the WAL might havebeen generated by another backend, in a transaction that might be stillrunning. In which case I don't see how updating XactLastRecEnd innextval_internal would fix this, right?

I did some experiments with increasing CACHE for the sequence, and thatmostly eliminates the overhead. See the message I sent a couple minutesago. IMHO that's a reasonable solution for the tiny number of peopleusing nextval() in a way that'd be affected by this (i.e. withoutwriting anything else in the xact).



regards

--
Tomas Vondra
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company

Re: sequences vs. synchronous replication

Reply via email to