Re: [HACKERS] buildfarm failures on smew and anole

2013-10-17 Thread Andres Freund
On 2013-10-16 09:35:46 -0400, Robert Haas wrote: > Gah. I fixed one instance of that problem in test_config_settings(), > but missed the other. Maybe it'd be better to default to none, just as max_connections defaults to 1 and shared_buffers to 16? As we write out the value in the config file, ev

Re: [HACKERS] buildfarm failures on smew and anole

2013-10-16 Thread Andres Freund
On 2013-10-16 09:44:32 -0400, Robert Haas wrote: > On Wed, Oct 16, 2013 at 9:37 AM, Andres Freund wrote: > > On 2013-10-16 09:35:46 -0400, Robert Haas wrote: > >> Gah. I fixed one instance of that problem in test_config_settings(), > >> but missed the other. > > > > Maybe it'd be better to defaul

Re: [HACKERS] buildfarm failures on smew and anole

2013-10-16 Thread Robert Haas
On Wed, Oct 16, 2013 at 9:37 AM, Andres Freund wrote: > On 2013-10-16 09:35:46 -0400, Robert Haas wrote: >> Gah. I fixed one instance of that problem in test_config_settings(), >> but missed the other. > > Maybe it'd be better to default to none, just as max_connections > defaults to 1 and shared

Re: [HACKERS] buildfarm failures on smew and anole

2013-10-16 Thread Robert Haas
On Wed, Oct 16, 2013 at 8:54 AM, Andres Freund wrote: > On 2013-10-16 08:39:10 -0400, Robert Haas wrote: >> On Tue, Oct 15, 2013 at 11:17 PM, Peter Eisentraut wrote: >> > On Mon, 2013-10-14 at 18:14 -0400, Robert Haas wrote: >> >> > I cleaned the semaphores on smew, but they came back. Whatever

Re: [HACKERS] buildfarm failures on smew and anole

2013-10-16 Thread Andres Freund
On 2013-10-16 08:39:10 -0400, Robert Haas wrote: > On Tue, Oct 15, 2013 at 11:17 PM, Peter Eisentraut wrote: > > On Mon, 2013-10-14 at 18:14 -0400, Robert Haas wrote: > >> > I cleaned the semaphores on smew, but they came back. Whatever is > >> > crashing is leaving the semaphores lying around. >

Re: [HACKERS] buildfarm failures on smew and anole

2013-10-16 Thread Robert Haas
On Tue, Oct 15, 2013 at 11:17 PM, Peter Eisentraut wrote: > On Mon, 2013-10-14 at 18:14 -0400, Robert Haas wrote: >> > I cleaned the semaphores on smew, but they came back. Whatever is >> > crashing is leaving the semaphores lying around. >> >> Ugh. When did you do that exactly? I thought I fix

Re: [HACKERS] buildfarm failures on smew and anole

2013-10-15 Thread Peter Eisentraut
On Mon, 2013-10-14 at 18:14 -0400, Robert Haas wrote: > > I cleaned the semaphores on smew, but they came back. Whatever is > > crashing is leaving the semaphores lying around. > > Ugh. When did you do that exactly? I thought I fixed the problem > that was causing that days ago, and the last 4

Re: [HACKERS] buildfarm failures on smew and anole

2013-10-14 Thread Robert Haas
On Mon, Oct 14, 2013 at 4:29 PM, Peter Eisentraut wrote: > On Fri, 2013-10-11 at 15:33 -0400, Robert Haas wrote: >> Can the owners of these buildfarm machines please check whether there >> are extra semaphores allocated and if so free them? Or at least >> reboot, to see if that unbreaks the build

Re: [HACKERS] buildfarm failures on smew and anole

2013-10-14 Thread Peter Eisentraut
On Fri, 2013-10-11 at 15:33 -0400, Robert Haas wrote: > Can the owners of these buildfarm machines please check whether there > are extra semaphores allocated and if so free them? Or at least > reboot, to see if that unbreaks the build? I cleaned the semaphores on smew, but they came back. What

Re: [HACKERS] buildfarm failures on smew and anole

2013-10-14 Thread Robert Haas
On Mon, Oct 14, 2013 at 1:33 PM, Tom Lane wrote: > Robert Haas writes: >> Anyway, as Andres said, the machines were working fine until recently, >> so I think we just need to get them un-broken. > > I think you're talking past each other. What would be useful here is > to find out *why* these ma

Re: [HACKERS] buildfarm failures on smew and anole

2013-10-14 Thread Tom Lane
Robert Haas writes: > Anyway, as Andres said, the machines were working fine until recently, > so I think we just need to get them un-broken. I think you're talking past each other. What would be useful here is to find out *why* these machines are now failing, when they didn't before. There migh

Re: [HACKERS] buildfarm failures on smew and anole

2013-10-14 Thread Andres Freund
On 2013-10-14 09:28:04 -0400, Robert Haas wrote: > # By convention, we put no more than twenty tests in any one parallel group; > # this limits the number of connections needed to run the tests. > > If it's not supposed to matter how many connections are available, > then that comment is misleadin

Re: [HACKERS] buildfarm failures on smew and anole

2013-10-14 Thread Robert Haas
On Mon, Oct 14, 2013 at 9:22 AM, Andrew Dunstan wrote: >> Maybe I didn't explain that well. The problem is that the regression >> tests require at least 20 connections to run, and those two machines >> are currently auto-selecting 10 connections, so make check is failing. > > Why do they need 20

Re: [HACKERS] buildfarm failures on smew and anole

2013-10-14 Thread Andrew Dunstan
On 10/14/2013 09:12 AM, Robert Haas wrote: On Fri, Oct 11, 2013 at 4:03 PM, Andrew Dunstan wrote: Can the owners of these buildfarm machines please check whether there are extra semaphores allocated and if so free them? Or at least reboot, to see if that unbreaks the build? It is possible to

Re: [HACKERS] buildfarm failures on smew and anole

2013-10-14 Thread Andres Freund
On 2013-10-14 09:12:09 -0400, Robert Haas wrote: > On Fri, Oct 11, 2013 at 4:03 PM, Andrew Dunstan wrote: > >> Can the owners of these buildfarm machines please check whether there > >> are extra semaphores allocated and if so free them? Or at least > >> reboot, to see if that unbreaks the build?

Re: [HACKERS] buildfarm failures on smew and anole

2013-10-14 Thread Robert Haas
On Fri, Oct 11, 2013 at 4:03 PM, Andrew Dunstan wrote: >> Can the owners of these buildfarm machines please check whether there >> are extra semaphores allocated and if so free them? Or at least >> reboot, to see if that unbreaks the build? > > It is possible to set the buildfarm config > > b

Re: [HACKERS] buildfarm failures on smew and anole

2013-10-11 Thread Andrew Dunstan
On 10/11/2013 03:33 PM, Robert Haas wrote: The build is continuing to fail on smew and anole. The reason it's failing is because those machines are choosing max_connections = 10, which is not enough to run the regression tests. I think this is probably because of System V semaphore exhaustion.