Re: Adding column "mem_usage" to view pg_prepared_statements

Konstantin Knizhnik Tue, 06 Aug 2019 00:49:50 -0700



On 05.08.2019 22:35, Daniel Migowski wrote:

.
I think that including in pg_prepared_statements information aboutmemory used this statement is very useful.CachedPlanMemoryUsage function may be useful not only for this view,but for example it is also need in my autoprepare patch.
I would love to use your work if it's done, and would also love towork together here. I am quite novice in C thought, I might take mytime to get things right.


Right now I resused your implementation of CachedPlanMemoryUsage function:)

Before I took in account only memory used by plan->context, but not ofplan->query_context and plan->gplan->context (although query_context forraw parse tree seems to be much smaller).

I wonder if you consider go further and not only report but controlmemory used by prepared statements?For example implement some LRU replacement discipline on top ofprepared statements cache which can
evict rarely used prepared statements to avoid memory overflow.
THIS! Having some kind of safety net here would finally make sure thatmy precious processes will not grow endlessly until all mem is eatenup, even with prep statement count limits.
While working on stuff I noticed there are three things stored in aCachedPlanSource. The raw query tree (a relatively small thing), thequery list (analyzed-and-rewritten query tree) which takes up the mostmemory (at least here, maybe different with your usecases), and (oftenafter the 6th call) the CachedPlan, which is more optimized that thequery list and often needs less memory (half of the query list here).
The query list seems to take the most time to create here, because Ihit the GEQO engine here, but it could be recreated easily (up to500ms for some queries). Creating the CachedPlan afterwards takes 60msin some usecase. IF we could just invalidate them from time to time,that would be grate.
Also, invalidating just the queries or the CachedPlan would notinvalidate the whole prepared statement, which would break clientsexpectations, but just make them a slower, adding much to thestability of the system. I would pay that price, because I just don'tuse manually named prepared statements anyway and just autogeneratethem as performance sugar without thinking about what really needs tobe prepared anyway. There is an option in the JDBC driver to useprepared statements automatically after you have used them a few time.

I have noticed that cached plans for implicitly prepared statements instored procedures are not shown in pg_prepared_statements view.It may be not a problem in your case (if you are accessing Postgresthrough JDBC and not using prepared statements),but can cause memory overflow in applications actively using storedprocedures, because unlike explicitly created prepared statements, it isvery difficult

to estimate and control statements implicitly prepared by plpgsql.

I am not sure what will be the best solution in this case. Adding yetanother view for implicitly prepared statements? Or include them inpg_prepared_statements view?

We have such patch for PgPro-EE but it limits only number of preparedstatement, not taken in account amount of memory used by them.I think that memory based limit will be more accurate (although itadds more overhead).
Limiting them by number is already done automatically here and wouldreally not be of much value, but having a mem limit would be great. Wecould have a combined memory limit for your autoprepared statements aswell as the manually prepared ones, so clients can know for sure theserver processes won't eat up more that e.g. 800MB for preparedstatements. And also I would like to have this value spread across allclient processes, e.g. specifying max_prepared_statement_total_mem=5Gfor the server, and maybe max_prepared_statement_mem=1G for clientprocesses. Of course we would have to implement cross client processinvalidation here, and I don't know if communicating client processesare even intended.
Anyway, a memory limit won't really add that much more overhead. Atleast not more than having no prepared statements at all because ofthe fear of server OOMs, or have just a small count of thosestatements. I was even think about a prepared statement reaper thatchecks the pg_prepared_statements every few minutes to clean things upmanually, but having this in the server would be of great value to me.

Right now memory context has no field containing amount of currentlyused memory.This is why context->methods->stats function implementation has totraverse all blocks to calculate size of memory used by context.It may be not so fast for large contexts. But I do not expect thatcontexts of prepared statements will be very large, althoughI have deal with customers which issued queries with query text lengthlarger than few megabytes. I afraid to estimate size of plan for suchqueries.This is the reason of my concern that calculating memory context sizemay have negative effect on performance. But is has to be done only oncewhen statement is prepared. So may be it is not a problem at all.


--

Konstantin Knizhnik
Postgres Professional: http://www.postgrespro.com
The Russian Postgres Company

Re: Adding column "mem_usage" to view pg_prepared_statements

Reply via email to