Re: [HACKERS] Hot standby and b-tree killed items

Heikki Linnakangas Mon, 29 Dec 2008 02:46:14 -0800

marcin mank wrote:

Perhaps we should listen to the people that have said they don't want
queries cancelled, even if the alternative is inconsistent answers.

I don't like that much. PostgreSQL has traditionally avoided that veryhard. It's hard to tell what kind of inconsistencies you'd get, as it'ddepend on what plan is created, when a vacuum happens to run on master etc.

I think an alternative to that would be "if the wal backlog is too
big, let current queries finish and let incoming queries wait till the
backlog gets smaller".


Yeah, that makes sense too.

Many approaches have been proposed, and they all have differenttradeoffs and therefore fit different use cases. I'm not sure which onesare/will be included in the patch. We don't need all in 8.4, one or twosimplest ones will do just fine, and we can extend later.

Let me summarize. Whenever a WAL record conflicts with aquery-in-progress, we can:


1. kill the query, or
2. wait for the query to finish
3. let the query proceed, producing invalid results.

There's some combinations of those as well. You're proposal is avariation of 2, to avoid the problem of WAL application falling behindindefinitely. There's also the max_standby_delay option in the patch, towait a while, and then kill the query.

There's some additional optimizations that can be made to make thoseoptions less painful. Instead of killing all queries that might beaffected by a vacuum record, only kill them when they actually hit ablock that was vacuumed (Simon's idea of latestRemovedLSN field in pageheader).

Another line of attack is to avoid getting into the situation in thefirst place, by affecting behavior on the master. If the standby has anonline connection to the master (per the synch rep patch), it can tellmaster what the slave's OldestXmin is, and master can take that intoaccount and not remove tuples still needed by the slave. That's not goodfrom high availability point of view, you don't want a hung query in theslave to cause a long-running-transaction situation in the master, butfor other use cases it would be fine. Or we can just add a constant # oftransactions to OldestXmin in master, to get some breathing room in theserver.

The bottom line is that we have enough options to make everyone happy.Some understanding of the issue is required to tune it properly,however, so documentation is important.


--
  Heikki Linnakangas
  EnterpriseDB   http://www.enterprisedb.com

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Hot standby and b-tree killed items

Reply via email to