Thanks Cezary. As I feared, there are many potential culprits to sniff out.

Geoff

> On 16 May 2019, at 10:05 pm, Cezary Biernacki <cezary...@gmail.com> wrote:
> 
> Obviously it would help to learn what CPU is doing. Does the application
> become unstuck and start working normally after initial spike of activity?
> 
> I have not experienced exactly the same behaviour, and I have relatively
> similar technology - a Tapestry application running inside Docker on AWS
> EC2, thought it is a standalone process with Dropwizard/Jetty. But years
> ago I had related problems with a different WAR-based application that
> crashed after not being used some time. It was caused by the "tmpwatch"
> daemon on Linux deleting files in /tmp that had not been accessed after
> some time.
> 
> I recommend checking if anything in your application can be described as
> "it can be removed if not used after predetermined amount of time" or "it
> needs refreshing from time to time and this determined on access".
> Potential culprits:
> 
>   - in-application caches, e.g. one built with Guava's Cache;
>   - database connections or other TCP connections where an external
>   service can terminate idle sessions;
>   - an auto-scalling mechanism;
>   - the application server determining that last day user sessions are
>   expired and killing them with a listener doing something expensive on
>   session's termination.
> 
> Another option is weird garbage collection interactions, e.g. caused by
> non-trivial work in the "finalize" method somewhere in the application.
> 
> Cezary
> 
> 
> On Thu, May 16, 2019 at 3:33 AM JumpStart <
> geoff.callender.jumpst...@gmail.com> wrote:
> 
>> Hi all,
>> 
>> My app is working brilliantly under load, but after a quiet time it can be
>> very slow to respond, leading our first user of the day to tap the same
>> thing multiple times, and the next thing you know is CPU hits 100% and is
>> stuck there, and none of those requests returns a response. Nor do any new
>> requests return a response. Apache logs show that all the requests time out
>> after 60 secs, unanswered, and the health checkers start messaging the
>> support staff.
>> 
>> Has anyone else experienced this kind of thing?
>> 
>> Perhaps it’s something to do with our infrastructure? We’re running
>> Tapestry from an EAR in Wildfly in Docker in an AWS EC2 instance. Also in
>> that EC2 instance is Apache HTTPD in Docker.
>> 
>> Any thoughts, please! It’s a crazy problem.
>> 
>> Cheers,
>> 
>> Geoff
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: users-unsubscr...@tapestry.apache.org
>> For additional commands, e-mail: users-h...@tapestry.apache.org
>> 
>> 


---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscr...@tapestry.apache.org
For additional commands, e-mail: users-h...@tapestry.apache.org

Reply via email to