Re: very high replay_lag on 3-node cluster

2019-07-22 Thread Jehan-Guillaume (ioguix) de Rorthais
On Mon, 22 Jul 2019 12:58:47 +0200 Tiemen Ruiten wrote: [...] > I've attached a graph of network IO on all servers. The network config is > identical for all three nodes: 2x bonded gigabit connection to the same > stacked switch pair. AFAICS, Network doesn't looks saturated. > Currently I don't

Re: very high replay_lag on 3-node cluster

2019-07-22 Thread Tiemen Ruiten
On Mon, Jul 22, 2019 at 11:28 AM Jehan-Guillaume (ioguix) de Rorthais < iog...@free.fr> wrote: > Hi, > > On Mon, 22 Jul 2019 11:05:57 +0200 > Tiemen Ruiten wrote: > [...] > > > Now to my current issue: I took the advice to add more monitoring on > > > replay lag (using pg_last_xact_replay_timesta

Re: very high replay_lag on 3-node cluster

2019-07-22 Thread Jehan-Guillaume (ioguix) de Rorthais
Hi, On Mon, 22 Jul 2019 11:05:57 +0200 Tiemen Ruiten wrote: [...] > > Now to my current issue: I took the advice to add more monitoring on > > replay lag (using pg_last_xact_replay_timestamp) and things are not looking > > good. Last night replication lagged by almost 6 hours on one of the > > no

Re: very high replay_lag on 3-node cluster

2019-07-22 Thread Tiemen Ruiten
Anyone have an idea? Thanks very much in advance for any reply. On Fri, Jul 19, 2019 at 1:46 PM Tiemen Ruiten wrote: > Hello, > > In my previous post[1] on this list I brought up an issue with long > running checkpoints. I reduced checkpoint_timeout to a more reasonable > value (15m down from 60