Excellent! Thanks. One other quick question... What would happen if I didn't delete the recovery.conf file? Is that step just to prevent accidentally restarting the server with it there?
On Tue, Apr 14, 2009 at 6:26 PM, Erik Jones <ejo...@engineyard.com> wrote: > > On Apr 14, 2009, at 3:47 PM, Dan Hayes wrote: > > I've followed the implementation instructions at 24.4.2: >> http://www.postgresql.org/docs/current/static/warm-standby.html >> >> And I've used the archive/restore commands from the example in F23.2: >> http://www.postgresql.org/docs/current/static/pgstandby.html >> >> This all works great. The primary backs up the WAL archives to a mutually >> accessible folder and the backup system is continuously reading in the logs. >> Monitoring the "standby.log" file shows that the system is working as it >> should. >> >> My question is, if the main system goes down, what do I do to the backup >> machine to make it come alive? Currently, when I attempt to login to the >> database, I get the error: "psql: FATAL: the database is starting up" I'm >> assuming this means it can't be accessed while its in continuous recovery >> mode. I attempted to turn off postgresql on the main server and try >> connecting again, same error (was obvious, but I do things one step at a >> time...) So then I removed the "recovery.conf" file on the backup server >> and attempted to restart postgresql. It stopped fine, but the restart >> failed. >> >> The pg_log/postgresql-XXXX-XX-XX.log file says: >> LOG: database system was interrupted while in recovery at log time >> 2009-04-14 17:36:14 CDT >> HINT: If this has occurred more than once some data might be corrupted >> and you might need to choose an earlier recovery target. >> LOG: could not open file "pg_xlog/0000000100000002000000DD" (log file 2, >> segment 221): No such file or directory >> LOG: invalid primary checkpoint record >> LOG: could not open file "pg_xlog/0000000100000002000000DC" (log file 2, >> segment 220): No such file or directory >> LOG: invalid secondary checkpoint record >> PANIC: could not locate a valid checkpoint record >> LOG: startup process (PID 3756) was terminated by signal 6: Aborted >> LOG: aborting startup due to startup process failure >> >> Any ideas what I'm doing wrong? How do I bring the standby server out of >> standby do act as the primary? >> > > You should use pg_standby's -t flag to specify a stop file. Then all you > need to do is touch (create) that file and pg_standby will let the server > come out of recovery mode into normal operation mode. Be sure to rm or mv > the recovery.conf once that is complete. > > Erik Jones, Database Administrator > Engine Yard > Support, Scalability, Reliability > 866.518.9273 x 260 > Location: US/Pacific > IRC: mage2k > > > > > >