Re: Global snapshots

Alexey Kondratov Mon, 21 Sep 2020 07:24:43 -0700

On 2020-09-18 00:54, Bruce Momjian wrote:

On Tue, Sep  8, 2020 at 01:36:16PM +0300, Alexey Kondratov wrote:
Thank you for the link!
After a quick look on the Sawada-san's patch set I think that thereare two
major differences:
1. There is a built-in foreign xacts resolver in the [1], which shouldbemuch more convenient from the end-user perspective. It involves hugein-core
changes and additional complexity that is of course worth of.
However, it's still not clear for me that it is possible to resolveallforeign prepared xacts on the Postgres' own side with a 100%guarantee.Imagine a situation when the coordinator node is actually a HA clustergroup(primary + sync + async replica) and it failed just after PREPAREstage of
after local COMMIT. In that case all foreign xacts will be left in the
prepared state. After failover process complete synchronous replicawillbecome a new primary. Would it have all required info to properlyresolve
orphan prepared xacts?
Probably, this situation is handled properly in the [1], but I've notyetfinished a thorough reading of the patch set, though it has a greatdoc!
On the other hand, previous 0003 and my proposed patch rely on eithermanualresolution of hung prepared xacts or usage of externalmonitor/resolver.This approach is much simpler from the in-core perspective, butdoesn't look
as complete as [1] though.
Have we considered how someone would clean up foreign transactions ifthecoordinating server dies? Could it be done manually? Would anexternal
resolver, rather than an internal one, make this easier?

Both Sawada-san's patch [1] and in this thread (e.g. mine [2]) use 2PCwith a special gid format including a xid + server identification info.Thus, one can select from pg_prepared_xacts, get xid and coordinatorinfo, then use txid_status() on the coordinator (or ex-coordinator) toget transaction status and finally either commit or abort these staleprepared xacts. Of course this could be wrapped into some user-levelsupport routines as it is done in the [1].

As for the benefits of using an external resolver, I think that thereare some of them from the whole system perspective:

1) If one follows the logic above, then this resolver could bestateless, it takes all the required info from the Postgres nodesthemselves.

2) Then you can easily put it into container, which make it easier dodeploy to all these 'cloud' stuff like kubernetes.


3) Also you can scale resolvers independently from Postgres nodes.

I do not think that either of these points is a game changer, but we usea very simple external resolver altogether with [2] in our shardingprototype and it works just fine so far.

[1]https://www.postgresql.org/message-id/CA%2Bfd4k4HOVqqC5QR4H984qvD0Ca9g%3D1oLYdrJT_18zP9t%2BUsJg%40mail.gmail.com

[2]https://www.postgresql.org/message-id/3ef7877bfed0582019eab3d462a43275%40postgrespro.ru


--
Alexey Kondratov

Postgres Professional https://www.postgrespro.com
Russian Postgres Company

Re: Global snapshots

Reply via email to