Re: [GENERAL] Replication fell out of sync

David Kerr Mon, 02 Mar 2015 16:13:31 -0800

On Mon, Mar 02, 2015 at 04:06:02PM PDT, Adrian Klaver wrote:
> On 03/02/2015 03:25 PM, David Kerr wrote:
> >Howdy,
> >
> >I had an instance where a replica fell out of sync with the master.
> >
> >Now it's in in a state where it's unable to catch up because the master has 
> >already removed the WAL segment.
> >
> >(logs)
> >Mar  2 23:10:13 db13 postgres[11099]: [3-1] user=,db=,host= LOG:  streaming 
> >replication successfully connected to primary
> >Mar  2 23:10:13 db13 postgres[11099]: [4-1] user=,db=,host= FATAL:  could 
> >not receive data from WAL stream: FATAL:  requested WAL segment 
> >000000060000047C0000001F has already been removed
> >
> >
> >I was under the impression that when you setup streaming replication if you 
> >specify a restore command like : restore_command = 'cp /arch/%f %p'
> >
> >Then even if the slave falls out of sync, and the master removes the WAL 
> >segment, as long as you can still retrieve the WAL files, then it can bring 
> >itself back into sync.
> >
> >
> >But that doesn't seem to be happening.
> >
> >The restore_command is working
> ># Slave's $PGDATA/pg_xlog/
> >-rw------- 1 postgres postgres 16777216 Mar  2 21:29 000000060000047C0000001F
> >-rwx------ 1 postgres postgres 16777216 Mar  2 23:13 RECOVERYXLOG
> 
> Trying to figure out why the error occurred at Mar  2 23:10:13 and
> the file shows a time stamp of Mar  2 21:29, especially since you
> say the WAL segments flew past?
> 
> Are there any other WAL files in the slave ~/pg_xlog?


Turns out just that file had gotten corrupt on copy. When i re-pulled it from 
source life was good.

(phew!)

Thanks all for looking.


-- 
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general

Re: [GENERAL] Replication fell out of sync

Reply via email to