There is also http://bucardo.org/wiki/Check_postgres but I haven't been able
to get it to work for monitoring replication.  I am using a similar custom
script as Mahlon, but written in perl.  Looking at Mahlon's code has shown
me an error in how I have been thinking about calculating the replication
lag.  Thanks :)



On Wed, Oct 12, 2011 at 3:28 PM, Mahlon E. Smith <mah...@martini.nu> wrote:

> On Wed, Oct 12, 2011, Brandon Phelps wrote:
>
> > I use Nagios to monitor various things on a few servers and have
> > recently set up a hot-standby server and would obviously like to
> > include the state of streaming replication in my monitoring.
> >
> > [...]
> >
> > The confusion I have is how exactly can I determine just how far
> > behind the replication is during loads?  Currently with no traffic
> > (servers not in production yet) sent_location on the master is
> > "A/10018560" and pg_last_xlog_receive_location() on the standby also
> > returns "A/10018560"... How far apart can these be for me to start
> > worrying?  I could make a bit more sense of all this if they were
> > simple timestamps or something, but the hex values returned boggle my
> > mind.
> >
> > Any advice on these issues or other tips on monitoring the replication
> > would be greatly appreciated.
>
>
> Brandon:  I'm using this script for Mon, you should be able to adapt it
> to whatever language and monitoring system you please.
>
> http://www.martini.nu/misc/db_replication.monitor.txt
>
> --
> Mahlon E. Smith
> http://www.martini.nu/contact.html
>

Reply via email to