Re: [HACKERS] Simplifying replication

2010-11-12 Thread Bruce Momjian
Robert Haas wrote: > On Thu, Nov 11, 2010 at 10:13 PM, Bruce Momjian wrote: > > Robert Haas wrote: > >> On Thu, Oct 28, 2010 at 1:13 AM, Josh Berkus wrote: > >> > > >> >> I sort of agree with you that the current checkpoint_segments > >> >> parameter is a bit hard to tune, at least if your goal i

Re: [HACKERS] Simplifying replication

2010-11-11 Thread Robert Haas
On Thu, Nov 11, 2010 at 10:13 PM, Bruce Momjian wrote: > Robert Haas wrote: >> On Thu, Oct 28, 2010 at 1:13 AM, Josh Berkus wrote: >> > >> >> I sort of agree with you that the current checkpoint_segments >> >> parameter is a bit hard to tune, at least if your goal is to control >> >> the amount o

Re: [HACKERS] Simplifying replication

2010-11-11 Thread Bruce Momjian
Robert Haas wrote: > On Wed, Oct 27, 2010 at 3:53 AM, Simon Riggs wrote: > > On Tue, 2010-10-26 at 22:03 -0400, Robert Haas wrote: > >> On Tue, Oct 26, 2010 at 9:59 PM, Josh Berkus wrote: > >> > > >> >> If you set wal_keep_segments=0, archive_mode=on, and > >> >> archive_command=, you might run o

Re: [HACKERS] Simplifying replication

2010-11-11 Thread Bruce Momjian
Robert Haas wrote: > On Thu, Oct 28, 2010 at 1:13 AM, Josh Berkus wrote: > > > >> I sort of agree with you that the current checkpoint_segments > >> parameter is a bit hard to tune, at least if your goal is to control > >> the amount of disk space that will be used by WAL files. ?But I'm not > >>

Re: [HACKERS] Simplifying replication

2010-11-06 Thread Dimitri Fontaine
Hannu Krosing writes: >> > To make pg_basebackup.py self-sufficient it should also open 2nd >> > connection to the same master and make sure that all WAL files are >> > copied for the duration of base copy. Done now, please have a look and try it if possible: https://github.com/dimitri/pg_base

Re: [HACKERS] Simplifying replication

2010-11-06 Thread Hannu Krosing
On Sat, 2010-11-06 at 18:02 +0100, Dimitri Fontaine wrote: > Hannu Krosing writes: > > To make pg_basebackup.py self-sufficient it should also open 2nd > > connection to the same master and make sure that all WAL files are > > copied for the duration of base copy. > > Excellent idea, will make th

Re: [HACKERS] Simplifying replication

2010-11-06 Thread Dimitri Fontaine
Hannu Krosing writes: > To make pg_basebackup.py self-sufficient it should also open 2nd > connection to the same master and make sure that all WAL files are > copied for the duration of base copy. Excellent idea, will make that happen soon'ish. Regards, -- Dimitri Fontaine http://2ndQuadrant.f

Re: [HACKERS] Simplifying replication

2010-11-05 Thread Hannu Krosing
On Tue, 2010-10-19 at 10:31 +0200, Dimitri Fontaine wrote: > > 4. I can start a new replica off the master by running a single command-line > > utility on the standby and giving it connection information to the master. > > Using this connection, it should be able to start a backup snapshot, copy >

Re: [HACKERS] Simplifying replication

2010-10-31 Thread Dimitri Fontaine
Josh Berkus writes: >> You have to put the WAL files *somewhere* while you do the base backup. > > Agreed. That's why I like the idea of having a > max_wal_size/min_wal_time instead of keep_wal_segments or > checkpoint_segments. It's relatively simple for a DBA to know how much > disk space s/he

Re: [HACKERS] Simplifying replication

2010-10-28 Thread Robert Haas
On Thu, Oct 28, 2010 at 1:13 AM, Josh Berkus wrote: > >> I sort of agree with you that the current checkpoint_segments >> parameter is a bit hard to tune, at least if your goal is to control >> the amount of disk space that will be used by WAL files.  But I'm not >> sure your proposal is better.  

Re: [HACKERS] Simplifying replication

2010-10-27 Thread Josh Berkus
I sort of agree with you that the current checkpoint_segments parameter is a bit hard to tune, at least if your goal is to control the amount of disk space that will be used by WAL files. But I'm not sure your proposal is better. Instead of having a complicated formula for predicting how much

Re: [HACKERS] Simplifying replication

2010-10-27 Thread Robert Haas
On Wed, Oct 27, 2010 at 5:01 PM, Josh Berkus wrote: > >> You have to put the WAL files *somewhere* while you do the base backup. >> PostgreSQL can't itself work out where that is, nor can it work out >> ahead of time how big it will need to be, since it is up to you how you >> do your base backup.

Re: [HACKERS] Simplifying replication

2010-10-27 Thread Robert Haas
On Wed, Oct 27, 2010 at 3:53 AM, Simon Riggs wrote: > On Tue, 2010-10-26 at 22:03 -0400, Robert Haas wrote: >> On Tue, Oct 26, 2010 at 9:59 PM, Josh Berkus wrote: >> > >> >> If you set wal_keep_segments=0, archive_mode=on, and >> >> archive_command=, you might run out of disk space. >> >> >> >> I

Re: [HACKERS] Simplifying replication

2010-10-27 Thread Josh Berkus
> You have to put the WAL files *somewhere* while you do the base backup. > PostgreSQL can't itself work out where that is, nor can it work out > ahead of time how big it will need to be, since it is up to you how you > do your base backup. Setting a parameter to -1 doesn't make the problem > go a

Re: [HACKERS] Simplifying replication

2010-10-27 Thread Tom Lane
"Kevin Grittner" writes: > Josh Berkus wrote: >>> Except that changing wal_keep_segments doesn't require restarting >>> the master. >> >> Our docs say that it does: >> This parameter can only be set in the postgresql.conf file or on >> the server command line. > That sounds as though a reload

Re: [HACKERS] Simplifying replication

2010-10-27 Thread Kevin Grittner
Josh Berkus wrote: >> Except that changing wal_keep_segments doesn't require restarting >> the master. > > Our docs say that it does: > This parameter can only be set in the postgresql.conf file or on > the server command line. That sounds as though a reload would do it; I don't see that indi

Re: [HACKERS] Simplifying replication

2010-10-27 Thread Josh Berkus
It is the same to the user either way. In either case you have to change some settings and restart the master. Except that changing wal_keep_segments doesn't require restarting the master. Our docs say that it does: This parameter can only be set in the postgresql.conf file or on the serve

Re: [HACKERS] Simplifying replication

2010-10-27 Thread Simon Riggs
On Tue, 2010-10-26 at 22:03 -0400, Robert Haas wrote: > On Tue, Oct 26, 2010 at 9:59 PM, Josh Berkus wrote: > > > >> If you set wal_keep_segments=0, archive_mode=on, and > >> archive_command=, you might run out of disk space. > >> > >> If you set wal_keep_segments=-1, you might run out of disk spa

Re: [HACKERS] Simplifying replication

2010-10-26 Thread Robert Haas
On Tue, Oct 26, 2010 at 9:59 PM, Josh Berkus wrote: > >> If you set wal_keep_segments=0, archive_mode=on, and >> archive_command=, you might run out of disk space. >> >> If you set wal_keep_segments=-1, you might run out of disk space. >> >> Are you any more screwed in the second case than you are

Re: [HACKERS] Simplifying replication

2010-10-26 Thread Josh Berkus
> If you set wal_keep_segments=0, archive_mode=on, and > archive_command=, you might run out of disk space. > > If you set wal_keep_segments=-1, you might run out of disk space. > > Are you any more screwed in the second case than you are in the first > case? It is the same to the user either w

Re: [HACKERS] Simplifying replication

2010-10-26 Thread Robert Haas
On Tue, Oct 26, 2010 at 8:27 AM, Simon Riggs wrote: > On Thu, 2010-10-21 at 20:57 -0400, Robert Haas wrote: > >> Very true.  But the lack of a -1 setting for wal_keep_segments means >> that if you would like to take a backup without archiving, you must >> set wal_keep_segments to a value greater t

Re: [HACKERS] Simplifying replication

2010-10-26 Thread Simon Riggs
On Thu, 2010-10-21 at 20:57 -0400, Robert Haas wrote: > Very true. But the lack of a -1 setting for wal_keep_segments means > that if you would like to take a backup without archiving, you must > set wal_keep_segments to a value greater than or equal to the rate at > which you generate WAL segmen

Re: [HACKERS] Simplifying replication

2010-10-26 Thread Josh Berkus
> What happens if max_wal_size is less than checkpoint_segments? > Currently a checkpoint tries to leave WAL files which were generated > from the prior ckpt start to current ckpt end. Because those WAL files > are required for crash recovery. But we should delete some of them > according to max_w

Re: [HACKERS] Simplifying replication

2010-10-24 Thread Fujii Masao
On Sat, Oct 23, 2010 at 2:34 AM, Josh Berkus wrote: > When we get close enough to max_wal_size (we'll need a couple segments of > leeway, I think), we start recycling WAL segments even if they are less that > min_wal_time old. What happens if max_wal_size is less than checkpoint_segments? Current

Re: [HACKERS] Simplifying replication

2010-10-22 Thread Josh Berkus
max_wal_size min_wal_size [ scratches head... ] What's the functional effect of min_wal_size, exactly? Replaces wal_keep_segments. The rename is to make the GUCs obviously symmetrical, and to make it clear that the *meaning* of the variable has changed. Even better would be to replace

Re: [HACKERS] Simplifying replication

2010-10-22 Thread Tom Lane
Josh Berkus writes: > What would be sensible for DBAs is to have two settings: > max_wal_size > min_wal_size [ scratches head... ] What's the functional effect of min_wal_size, exactly? > Even better would be to replace min_wal_size with min_wal_time, which > would set a time span for the old

Re: [HACKERS] Simplifying replication

2010-10-22 Thread Josh Berkus
Please see http://archives.postgresql.org/pgsql-docs/2010-10/msg00038.php Ye gods and little fishes! You really want to talk arcane formulas. I've re-read that three times, and am still not sure that I could tell someone definitively how much disk space WAL needs for a given group of setti

Re: [HACKERS] Simplifying replication

2010-10-21 Thread Fujii Masao
On Fri, Oct 22, 2010 at 11:03 AM, Josh Berkus wrote: > >> I think it's pretty well explained in the fine manual. >> >> http://www.postgresql.org/docs/current/static/runtime-config-wal.html#GUC-WAL-KEEP-SEGMENTS > > Nope.  No relationship to checkpoint_segments is explained there.  Try > again? Pl

Re: [HACKERS] Simplifying replication

2010-10-21 Thread Robert Haas
On Thu, Oct 21, 2010 at 10:03 PM, Josh Berkus wrote: > >> I think it's pretty well explained in the fine manual. >> >> http://www.postgresql.org/docs/current/static/runtime-config-wal.html#GUC-WAL-KEEP-SEGMENTS > > Nope.  No relationship to checkpoint_segments is explained there.  Try > again? We

Re: [HACKERS] Simplifying replication

2010-10-21 Thread Josh Berkus
> I think it's pretty well explained in the fine manual. > > http://www.postgresql.org/docs/current/static/runtime-config-wal.html#GUC-WAL-KEEP-SEGMENTS Nope. No relationship to checkpoint_segments is explained there. Try again? >> If checkpoint_segments were a hard limit, then we could let a

Re: [HACKERS] Simplifying replication

2010-10-21 Thread Robert Haas
On Thu, Oct 21, 2010 at 9:09 PM, Josh Berkus wrote: > >>> Very true.  But the lack of a -1 setting for wal_keep_segments means >>> that if you would like to take a backup without archiving, you must >>> set wal_keep_segments to a value greater than or equal to the rate at >>> which you generate WA

Re: [HACKERS] Simplifying replication

2010-10-21 Thread Josh Berkus
>> Very true. But the lack of a -1 setting for wal_keep_segments means >> that if you would like to take a backup without archiving, you must >> set wal_keep_segments to a value greater than or equal to the rate at >> which you generate WAL segments multiplied by the time it takes you to >> run a

Re: [HACKERS] Simplifying replication

2010-10-21 Thread Bruce Momjian
Robert Haas wrote: > > However the user-unfriendliness isn't the fact that administrators > > need to determine how much disk they're willing to dedicate to > > Postgres. The user-unfriendliness is that they then have to specify > > this in terms of WAL log files and also have to know that we somet

Re: [HACKERS] Simplifying replication

2010-10-21 Thread Robert Haas
On Thu, Oct 21, 2010 at 8:52 PM, Greg Stark wrote: > On Thu, Oct 21, 2010 at 5:46 PM, Josh Berkus wrote: >> >>> Agreed, but as a reality check:  when I proposed that wal_keep_segments >>> = -1 would keep all WAL segments (for use while the file system was >>> being backed up), I was told administ

Re: [HACKERS] Simplifying replication

2010-10-21 Thread Greg Stark
On Thu, Oct 21, 2010 at 5:46 PM, Josh Berkus wrote: > >> Agreed, but as a reality check:  when I proposed that wal_keep_segments >> = -1 would keep all WAL segments (for use while the file system was >> being backed up), I was told administrators shoud compute how much free >> disk space they had.

Re: [HACKERS] Simplifying replication

2010-10-21 Thread Josh Berkus
> Agreed, but as a reality check: when I proposed that wal_keep_segments > = -1 would keep all WAL segments (for use while the file system was > being backed up), I was told administrators shoud compute how much free > disk space they had. Obviously easy of use is not our #1 priority. Depends.

Re: [HACKERS] Simplifying replication

2010-10-21 Thread Robert Haas
On Thu, Oct 21, 2010 at 8:22 PM, Bruce Momjian wrote: > Josh Berkus wrote: >> Greg, >> >> > The way things stand you *always* need archived logs. Even if you have >> > streaming set up it might try to use archived logs if it falls too far >> > behind. >> >> Actually, you don't.  If you're willing

Re: [HACKERS] Simplifying replication

2010-10-21 Thread Bruce Momjian
Josh Berkus wrote: > Greg, > > > The way things stand you *always* need archived logs. Even if you have > > streaming set up it might try to use archived logs if it falls too far > > behind. > > Actually, you don't. If you're willing to accept possible > desynchronization and recloning of the st

Re: [HACKERS] Simplifying replication

2010-10-21 Thread Mark Kirkwood
On 19/10/10 13:16, Josh Berkus wrote: Robert asked me to write this up, so here it is. It is critical that we make replication easier to set up, administrate and monitor than it currently is. In my conversations with people, this is more important to our users and the adoption of PostgreSQL

Re: [HACKERS] Simplifying replication

2010-10-20 Thread Robert Treat
On Tue, Oct 19, 2010 at 11:16 AM, Greg Smith wrote: > Josh Berkus wrote: > >> It is critical that we make replication easier to set up, administrate and >> monitor than it currently is. In my conversations with people, this is more >> important to our users and the adoption of PostgreSQL than sy

Re: [HACKERS] Simplifying replication

2010-10-20 Thread Dimitri Fontaine
Josh Berkus writes: > Well, one thing to be addressed is separating the PITR functionality from > replication. PITR needs a lot of features -- timelines, recovery stop > points, etc. -- which replication doesn't need or want. I think that > focussing on streaming replication functionality and ig

Re: [HACKERS] Simplifying replication

2010-10-19 Thread Josh Berkus
Greg, > The way things stand you *always* need archived logs. Even if you have > streaming set up it might try to use archived logs if it falls too far > behind. Actually, you don't. If you're willing to accept possible desynchronization and recloning of the standbys, then you can skip the archi

Re: [HACKERS] Simplifying replication

2010-10-19 Thread Greg Stark
On Tue, Oct 19, 2010 at 9:16 AM, Josh Berkus wrote: > Well, one thing to be addressed is separating the PITR functionality from > replication.  PITR needs a lot of features -- timelines, recovery stop > points, etc. -- which replication doesn't need or want.  I think that > focussing on streaming

Re: [HACKERS] Simplifying replication

2010-10-19 Thread Josh Berkus
Dimitri, Greg, I want to say a big big +1 here. The way replication and PITR setup are implemented now are a very good prototype, it's time to consolidate and get to something usable by normal people, as opposed to PostgreSQL full time geeks. Well, one thing to be addressed is separating the P

Re: [HACKERS] Simplifying replication

2010-10-19 Thread Greg Smith
Josh Berkus wrote: *shrug*. Robert asked me to write it up for the list based on the discussions around synch rep. Now you're going to bash me for doing so? Sorry, next time I'll make sure to bash Robert too. I don't have any problems with the basic ideas you're proposing, just concerns abo

Re: [HACKERS] Simplifying replication

2010-10-19 Thread Josh Berkus
You should enjoy one of the patches we're furiously working on then, which is aiming at some of the administration and monitoring pieces here. Great, glad to hear it! Would you be willing to go into detail? I have my own grand vision of how easy replication should be to setup too. So, sha

Re: [HACKERS] Simplifying replication

2010-10-19 Thread Greg Smith
Josh Berkus wrote: It is critical that we make replication easier to set up, administrate and monitor than it currently is. In my conversations with people, this is more important to our users and the adoption of PostgreSQL than synchronous replication is. You should enjoy one of the patches

Re: [HACKERS] Simplifying replication

2010-10-19 Thread Dimitri Fontaine
Hi, Josh Berkus writes: > It is critical that we make replication easier to set up, administrate and > monitor than it currently is. In my conversations with people, this is more > important to our users and the adoption of PostgreSQL than synchronous > replication is. I want to say a big big +

Re: [HACKERS] Simplifying replication

2010-10-18 Thread Brendan Jurd
On 19 October 2010 11:16, Josh Berkus wrote: > 4. I can start a new replica off the master by running a single command-line > utility on the standby and giving it connection information to the master. >  Using this connection, it should be able to start a backup snapshot, copy > the entire databas

[HACKERS] Simplifying replication

2010-10-18 Thread Josh Berkus
Robert asked me to write this up, so here it is. It is critical that we make replication easier to set up, administrate and monitor than it currently is. In my conversations with people, this is more important to our users and the adoption of PostgreSQL than synchronous replication is. Firs