Re: Notes on physical replica failover with logical publisher or subscriber

Alexey Kondratov Mon, 30 Nov 2020 09:34:38 -0800

Hi Craig,

On 2020-11-30 06:59, Craig Ringer wrote:


https://wiki.postgresql.org/wiki/Logical_replication_and_physical_standby_failover

Thank you for sharing these notes. I have not dealt a lot withphysical/logical replication interoperability, so those were mostly newproblems for me to know.


One point from the wiki page, which seems clear enough to me:

```

Logical slots can fill pg_wal and can't benefit from archiving. Teachthe logical decoding page read callback how to use the restore_commandto retrieve WAL segs temporarily if they're not found in pg_wal...

```

It does not look like a big deal to teach logical decoding process touse restore_command, but I have some doubts about how everything willperform in the case when we started getting WAL from archive fordecoding purposes. If we started using restore_command, then subscriberlagged long enough to exceed max_slot_wal_keep_size. Taking into accountthat getting WAL files from the archive has an additional overhead andthat primary continues generating (and archiving) new segments, there isa possibility for primary to start doing this double duty forever ---archive WAL file at first and get it back for decoding when requested.

Another problem is that there are maybe several active decoders, IIRC,so they would have better to communicate in order to avoid fetching thesame segment twice.


I tried to address many of these issues with failover slots, but I am
not trying to beat that dead horse now. I know that at least some
people here are of the opinion that effort shouldn't go into
logical/physical replication interoperation anyway - that we should
instead address the remaining limitations in logical replication so
that it can provide complete HA capabilities without use of physical
replication. So for now I'm just trying to save others who go looking
into these issues some time and warn them about some of the less
obvious booby-traps.

Another point to add regarding logical replication capabilities to buildlogical-only HA system --- logical equivalent of pg_rewind. At least Ihave not noticed anything after brief reading of the wiki page. IIUC,currently there is no way to quickly return ex-primary (ex-logicalpublisher) into HA-cluster without doing a pg_basebackup, isn't it? Itseems that we should have the same problem here as with physicalreplication --- ex-primary may accept some xacts after promotion of newprimary, so their history diverges and old primary should be rewoundbefore being returned as standby (subscriber).



Regards
--
Alexey Kondratov

Postgres Professional https://www.postgrespro.com
Russian Postgres Company

Re: Notes on physical replica failover with logical publisher or subscriber

Reply via email to