Re: [BUGS] Completely broken replica after PANIC: WAL contains references to invalid pages

2013-10-10 Thread Sergey Konoplev
On Thu, Oct 10, 2013 at 2:48 PM, Alvaro Herrera wrote: >> Could you please give me a hint of how to check if this patch was >> included in 9.2.5 or not? > > Yes, this was committed in June: > > commit 99ee15b315c187045a95db7b27fd9d866aea93e0 > Author: Simon Riggs > Date: Sun Jun 23 11:05:02 201

Re: [BUGS] Completely broken replica after PANIC: WAL contains references to invalid pages

2013-10-10 Thread Alvaro Herrera
Sergey Konoplev escribió: > On Tue, Jun 11, 2013 at 6:50 AM, Tom Lane wrote: > > Sergey Konoplev writes: > >> Just curious, what is the planned date for the next minor release, and > >> BTW where is it possible to see the roadmap for minor releases? > > > > There is no planned date, and certainly

Re: [BUGS] Completely broken replica after PANIC: WAL contains references to invalid pages

2013-10-10 Thread Sergey Konoplev
On Tue, Jun 11, 2013 at 6:50 AM, Tom Lane wrote: > Sergey Konoplev writes: >> Just curious, what is the planned date for the next minor release, and >> BTW where is it possible to see the roadmap for minor releases? > > There is no planned date, and certainly no "roadmap". We make minor > releas

Re: [BUGS] Completely broken replica after PANIC: WAL contains references to invalid pages

2013-08-19 Thread Sergey Konoplev
Hi all, I recently noticed that I continue to receive "allocation failure" messages from postmaster in /var/log/messages on the server where the problem occurred. Aug 17 23:00:51 tms2 kernel: : postmaster: page allocation failure. order:5, mode:0xd0 Aug 17 23:00:51 tms2 kernel: : Pid: 21223, comm

Re: [BUGS] Completely broken replica after PANIC: WAL contains references to invalid pages

2013-06-11 Thread Tom Lane
Sergey Konoplev writes: > Just curious, what is the planned date for the next minor release, and > BTW where is it possible to see the roadmap for minor releases? There is no planned date, and certainly no "roadmap". We make minor releases when the core team judges that enough (or severe enough)

Re: [BUGS] Completely broken replica after PANIC: WAL contains references to invalid pages

2013-06-11 Thread Sergey Konoplev
On Mon, Jun 10, 2013 at 11:43 PM, Simon Riggs wrote: > On 11 June 2013 04:36, Sergey Konoplev wrote: >> Are there any plans to commit this patch and what version it is going >> to be done to? > > I'll be committing this soon, since we're likely coming up to the next > point release soon. I see,

Re: [BUGS] Completely broken replica after PANIC: WAL contains references to invalid pages

2013-06-10 Thread Simon Riggs
On 11 June 2013 04:36, Sergey Konoplev wrote: > Hi, > > On Thu, May 9, 2013 at 7:28 PM, Sergey Konoplev wrote: >> On Tue, Apr 2, 2013 at 11:26 AM, Andres Freund >> wrote: >>> The attached patch fixes this although I don't like the way it knowledge of >>> the >>> point up to which StartupSUBTRA

Re: [BUGS] Completely broken replica after PANIC: WAL contains references to invalid pages

2013-06-10 Thread Sergey Konoplev
Hi, On Thu, May 9, 2013 at 7:28 PM, Sergey Konoplev wrote: > On Tue, Apr 2, 2013 at 11:26 AM, Andres Freund wrote: >> The attached patch fixes this although I don't like the way it knowledge of >> the >> point up to which StartupSUBTRANS zeroes pages is handled. > > One month has passed since t

Re: [BUGS] Completely broken replica after PANIC: WAL contains references to invalid pages

2013-05-09 Thread Sergey Konoplev
On Tue, Apr 2, 2013 at 11:26 AM, Andres Freund wrote: > The attached patch fixes this although I don't like the way it knowledge of > the > point up to which StartupSUBTRANS zeroes pages is handled. One month has passed since the patched version was installed in our production environment and ca

Re: [BUGS] Completely broken replica after PANIC: WAL contains references to invalid pages

2013-04-05 Thread Sergey Konoplev
On Fri, Apr 5, 2013 at 7:33 AM, Andres Freund wrote: > Looks like it could be fixed by the patch. But that seems to imply that > you restarted not long before that? Could you check if theres a > different error before those? Yes it had happened straight after restart this time. There are no any e

Re: [BUGS] Completely broken replica after PANIC: WAL contains references to invalid pages

2013-04-05 Thread Andres Freund
On 2013-04-05 07:22:08 -0700, Sergey Konoplev wrote: > On Fri, Apr 5, 2013 at 7:15 AM, Andres Freund wrote: > > Cool. It would be very helpful if you could apply the patch and verify > > that it works, it has been written somewhat blindly. Also I am afraid > > that at least last time there was a s

Re: [BUGS] Completely broken replica after PANIC: WAL contains references to invalid pages

2013-04-05 Thread Sergey Konoplev
On Fri, Apr 5, 2013 at 7:15 AM, Andres Freund wrote: > Cool. It would be very helpful if you could apply the patch and verify > that it works, it has been written somewhat blindly. Also I am afraid > that at least last time there was a second bug involved. Okay, I will try to talk to admins but I

Re: [BUGS] Completely broken replica after PANIC: WAL contains references to invalid pages

2013-04-05 Thread Andres Freund
On 2013-04-05 07:10:12 -0700, Sergey Konoplev wrote: > On Tue, Apr 2, 2013 at 11:26 AM, Andres Freund wrote: > > The attached patch fixes this although I don't like the way it knowledge of > > the > > point up to which StartupSUBTRANS zeroes pages is handled. > > Thank you for the patch, Andres.

Re: [BUGS] Completely broken replica after PANIC: WAL contains references to invalid pages

2013-04-05 Thread Sergey Konoplev
On Tue, Apr 2, 2013 at 11:26 AM, Andres Freund wrote: > The attached patch fixes this although I don't like the way it knowledge of > the > point up to which StartupSUBTRANS zeroes pages is handled. Thank you for the patch, Andres. Is it included in 9.2.4? BTW, it has happened again and I am g

Re: [BUGS] Completely broken replica after PANIC: WAL contains references to invalid pages

2013-04-02 Thread Andres Freund
On 2013-04-02 12:10:12 +0200, Andres Freund wrote: > On 2013-04-01 08:49:16 +0100, Simon Riggs wrote: > > On 30 March 2013 17:21, Andres Freund wrote: > > > > > So if the xid is later than latestObservedXid we extend subtrans one by > > > one. So far so good. But we initialize it in > > > ProcArr

Re: [BUGS] Completely broken replica after PANIC: WAL contains references to invalid pages

2013-04-02 Thread Andres Freund
On 2013-04-01 08:49:16 +0100, Simon Riggs wrote: > On 30 March 2013 17:21, Andres Freund wrote: > > > So if the xid is later than latestObservedXid we extend subtrans one by > > one. So far so good. But we initialize it in > > ProcArrayApplyRecoveryInfo() when consistency is initially reached: >

Re: [BUGS] Completely broken replica after PANIC: WAL contains references to invalid pages

2013-04-01 Thread Simon Riggs
On 30 March 2013 17:21, Andres Freund wrote: > So if the xid is later than latestObservedXid we extend subtrans one by > one. So far so good. But we initialize it in > ProcArrayApplyRecoveryInfo() when consistency is initially reached: > latestObservedXid = running->n

Re: [BUGS] Completely broken replica after PANIC: WAL contains references to invalid pages

2013-03-30 Thread ajmcello
unsubscribe On Thu, Mar 28, 2013 at 10:30 PM, Sergey Konoplev wrote: > Hi all, > > A couple of days ago I found the replica stopped after the PANIC message: > > PANIC: WAL contains references to invalid pages > > When I tried to restart it I got this FATAL: > > FATAL: could not access status

Re: [BUGS] Completely broken replica after PANIC: WAL contains references to invalid pages

2013-03-30 Thread Andres Freund
On 2013-03-29 14:53:26 -0700, Sergey Konoplev wrote: > On Fri, Mar 29, 2013 at 2:38 PM, Andres Freund wrote: > > I have to admit, I find it a bit confusing that so many people report a > > bug and then immediately destroy all evidence of the bug. Just seems to > > a happen a bit too frequently. >

Re: [BUGS] Completely broken replica after PANIC: WAL contains references to invalid pages

2013-03-29 Thread Sergey Konoplev
On Fri, Mar 29, 2013 at 2:38 PM, Andres Freund wrote: > I have to admit, I find it a bit confusing that so many people report a > bug and then immediately destroy all evidence of the bug. Just seems to > a happen a bit too frequently. You see, businesses usually need it up ASAP again. Sorry, I mu

Re: [BUGS] Completely broken replica after PANIC: WAL contains references to invalid pages

2013-03-29 Thread Andres Freund
On 2013-03-29 14:19:59 -0700, Sergey Konoplev wrote: > On Fri, Mar 29, 2013 at 1:52 PM, anara...@anarazel.de > wrote: > > I think I See whats going on. Do you still have the datadir available? If > > so, could you send the pg_controldata output? > > I have already rebuilt the replica, however be

Re: [BUGS] Completely broken replica after PANIC: WAL contains references to invalid pages

2013-03-29 Thread Sergey Konoplev
On Fri, Mar 29, 2013 at 1:52 PM, anara...@anarazel.de wrote: > I think I See whats going on. Do you still have the datadir available? If so, > could you send the pg_controldata output? I have already rebuilt the replica, however below is the output if it is useful: pg_control version number:

Re: [BUGS] Completely broken replica after PANIC: WAL contains references to invalid pages

2013-03-29 Thread anara...@anarazel.de
Hi Sergey Konoplev schrieb: >Hi all, > >A couple of days ago I found the replica stopped after the PANIC >message: > >PANIC: WAL contains references to invalid pages > >When I tried to restart it I got this FATAL: > >FATAL: could not access status of transaction 280557568 > >Below is the descr

[BUGS] Completely broken replica after PANIC: WAL contains references to invalid pages

2013-03-28 Thread Sergey Konoplev
Hi all, A couple of days ago I found the replica stopped after the PANIC message: PANIC: WAL contains references to invalid pages When I tried to restart it I got this FATAL: FATAL: could not access status of transaction 280557568 Below is the description of the server and information from P