On May 24, 2012, at 3:35 PM, Tom Lane wrote:

> Jeff Frost <j...@pgexperts.com> writes:
>> On May 24, 2012, at 3:13 PM, Tom Lane wrote:
>>> Huh.  A bit bigger, but not by that much.  It doesn't seem like this
>>> would be enough to make seqscan performance fall off a cliff, as it
>>> apparently did.  Unless maybe the slightly-bloated catalogs were just a
>>> bit too large to fit in RAM, and so the repeated seqscans went from
>>> finding all their data in kernel disk buffers to finding none of it?
> 
>> Seems unlikely.
>> Server has 128GB of RAM.
> 
> Hm ... sure seems like that ought to be enough ;-), even with a near-2GB
> pg_attribute table.  What's your shared_buffers setting?

It's 8GB.

> 
>> BTW, when I connected to it this time, I had a really long time before my 
>> psql was able to send a query, so it seems to be still broken at least.
> 
> Yeah, I was afraid that re-initdb would be at best a temporary solution.

Oh, sorry, I wasn't clear on that.  The currently running system is still 
happy, but the old data directory got stuck in 'startup' for a while when I 
connected via psql.

> 
> It would probably be useful to confirm the theory that relcache rebuild
> is what makes it stall.  You should be able to manually remove the
> pg_internal.init file in the database's directory; then connect with
> psql, and time how long it takes before the pg_internal.init file
> reappears.


So, you're thinking autovac invalidates the cache and causes it to be rebuilt, 
then a bunch of new connections get stalled as they all wait for the rebuild?

I'll see if I can get the customer to move the data directory to a test system 
so I can futz with it on a non production system.
-- 
Sent via pgsql-bugs mailing list (pgsql-bugs@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-bugs

Reply via email to