Re: RemoveAbandoned Problems

Phil Steitz Wed, 08 Dec 2021 12:23:57 -0800



On 12/8/21 6:36 AM, Christopher Schultz wrote:

Jerry,

On 12/7/21 20:59, Jerry Malcolm wrote:
Chris, The way I thought it worked was if I configured'RemoveAbandonedOnBorrow' and RemoveAbandonedTimeout="15" was thateach time I requested a new connection from the pool, any connectionsthat had been idle for >15 minutes and had not been returned by mycode to the pool would be recovered, returned to the pool and logged(assuming logAbandoned was set).
Nope. "removeAbandoned" causes any connection that isn't returned tothe pool to be *removed from the pool*, and replaced with a new(presumably working) connection. The connection that was neverreturned ... stays out there, doing whatever it was doing.

Not exactly. Abandoned connection cleanup does try to physically closeconnections [1] that are deemed abandoned. It creates capacity tocreate new ones but it does not create them immediately.

"logAbandoned" just lets you know when the pool gives up. It doesn't"do" anything (other than the logging).
The alternative would be for the pool to forcibly terminate theconnection, which could cause all kinds of chaos, so it does the onlything it can reasonably do: forget the connection ever existed in thefirst place. If your code never closes it, and the Connection objectnever gets GC'd (and, presumably, closed in the process), then it justlived forever, wasting an open-connection to your db. Since you havelimited your total connections (per user? per host?) you eventuallyrun out due to the leak.

This is why you need to be careful to set the abandoned timeout longenough so that "chaos" does not ensue. The pool tries to physicallyclose abandoned connections when this is configured to happen. Ifclients retain handles to them and later try to use them, they will getexceptions. You can see this confirmed in DBCP'sTestAbandonedBasicDataSource unit tests.

Until a few days ago I had a code error that was bypassing theclosing of the connection in certain situations, and after 12-24hours the pool had worked its way up to maxing out. My problem isfixed now, and the numActive count is staying fairly flat duringnormal activity. But the way I understood removeAbandonedOnBorrowwas that TC connection pooling code would not allow errantconnections to remain in use forever.
I'm sure I'm just misunderstanding how it works. Again, not criticalat this moment. But I'd like to figure out where my understanding iswrong for future situations.

I think that the reason that you did not see connections closed by thepool may have been that you did not get close enough to the maxTotalsetting (see other response above) or you had idle connections in thepool as it was leaking and you did not havetimeBetweenEvictionRunsMillis set. If you settimeBetweenEvictionRunsMillis to a positive value, that will triggerunconditional removal when it runs (i.e., it does not check how closethe pool is to maxTotal or how many idle connections there are). TheremoveAbandonedOnBorrow setting is really more of a liveness than acleanup feature - basically trying to keep the pool ahead of demand bycleaning up when it is exhausted or close to it.


Phil

[1] As of DBCP 2.9.0, abort is used in place of close. Seehttps://issues.apache.org/jira/browse/DBCP-567

You thought the pool would "clean-up" the mess. IT doesn't. What it*does* do is allow the pool to continue to function and provide itsservice to the application, even when the application is leakingconnections.
So, rather than starving clients when connections leak, thoseconnections are simply allowed to leak.
I always recommend running with maxActive="1" in development, withremoveAbandoned="false" and logAbandoned="true". You'll find any leaksVERY quickly. ;)
-chris
On 12/7/2021 2:31 PM, Christopher Schultz wrote:
Jerry,

On 12/4/21 23:06, Jerry Malcolm wrote:
I had a db connection leak in my code where an error conditionwould throw an exception and bypass the connection cleanup code. Ifound that and fixed it. But before I found the problem, myprogram was overrunning the max connections and locking out. Itwould take sometimes 12 hours after a reboot to go from 0connections to max. Normal steady state connections shouldcurrently be under 50. The ramp over several hours to max was veryobvious in my numActive log. What I'm confused about is whyremoveAbandoned didn't recover those connections.
When you say "recover"... what exactly do you mean?
Granted, if I write my code correctly, removeAbandoned shouldn't benecessary. The coding problem is solved now. But apparently myunderstanding/configuration of removeAbandoned is not correct.
Possibly, but you didn't state your expectations.
I'd like to have that figured out in case there's a next time(which sadly there probably will be....). Basically, with theconfiguration below, I'm not getting any idle connections detectedand returned. This is TC 8.5.73. And the leak was happening on abasic request/response (no threads involved). I requested theconnection, encountered an error, and returned without closing theconnection. Ideas? Thx.
-chris

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscr...@tomcat.apache.org
For additional commands, e-mail: users-h...@tomcat.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscr...@tomcat.apache.org
For additional commands, e-mail: users-h...@tomcat.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscr...@tomcat.apache.org
For additional commands, e-mail: users-h...@tomcat.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscr...@tomcat.apache.org
For additional commands, e-mail: users-h...@tomcat.apache.org

Re: RemoveAbandoned Problems

Reply via email to