this could be ARTEMIS-4084: this commit, which is not part of a release yet. We should make 2.27.1 next week:
I'm in vacation mode this week due to the US Thanksgiving, if nobody sends a release this week I will cut a release on monday. commit 03b82142eb0844b9de02ca3d7ed365d849e3ac02 Author: Clebert Suconic <clebertsuco...@apache.org> Date: Tue Nov 8 09:42:16 2022 -0500 ARTEMIS-4084 Fixing addSorted with large transactions when cancelling a large number of messages, the addSorted could be holding a lock for too long causing the server to crash under CriticalAnalyzer co-authored: AntonRoskvist <anton.roskv...@volvo.com> (discovering the issue and providing the test ClientCrashMassiveRollbackTest.java) On Wed, Nov 23, 2022 at 9:16 AM Clebert Suconic <clebert.suco...@gmail.com> wrote: > > It was said your issue was fixed in 2.20. But it was not said to move to > that version. > > > When you upgrade always go to the latest. > > > On Wed, Nov 23, 2022 at 8:53 AM Thomas Wahlberg <thomas.wahlb...@nextup.fi> > wrote: >> >> Hi, >> it was first recommendation here. Second, I found some troubling posts >> about missing messages in newer versions, not sure if they're real problems >> or not. Third, 2.19 to 2.20 didn't require us to do any other configuration >> updates, which newer version might (?) require >> >> On Wed, Nov 23, 2022 at 3:44 PM Clebert Suconic <clebert.suco...@gmail.com> >> wrote: >> >> > Why did you move to 2.20 and not the latest ? >> > >> > >> > On Wed, Nov 23, 2022 at 2:37 AM Thomas Wahlberg <thomas.wahlb...@nextup.fi >> > > >> > wrote: >> > >> > > We have now upgraded one environment to 2.20.0 and notice some difference >> > > in the logs. It might be pointing to a certain queue that receives over >> > 100 >> > > kb sized messages. 2.20.0 is still crashing in similar fashion. There's a >> > > huge IOWAIT reported from our monitoring tools during the crash/restart >> > and >> > > wrapper.log is filled with hundreds, if not thousands, of errors like >> > this, >> > > all during 1-2 second time period: >> > > >> > > INFO | jvm 2 | 2022/11/23 08:02:12 | WARN >> > > [org.apache.activemq.artemis.core.server] AMQ222061: Client connection >> > > failed, clearing up resources for session >> > > 90a07ca5-6a2e-11ed-9aa1-767acc7119a2 >> > > INFO | jvm 2 | 2022/11/23 08:02:12 | WARN >> > > [org.apache.activemq.artemis.core.server] AMQ222107: Cleared up >> > resources >> > > for session 90a07ca5-6a2e-11ed-9aa1-767acc7119a2 >> > > INFO | jvm 2 | 2022/11/23 08:02:12 | WARN >> > > [org.apache.activemq.artemis.core.server] AMQ222061: Client connection >> > > failed, clearing up resources for session >> > > 90ce4367-6a2e-11ed-9aa1-767acc7119a2 >> > > INFO | jvm 2 | 2022/11/23 08:02:12 | WARN >> > > [org.apache.activemq.artemis.core.server] AMQ222107: Cleared up >> > resources >> > > for session 90ce4367-6a2e-11ed-9aa1-767acc7119a2 >> > > >> > > >> > > >> > > >> > > On Tue, Nov 8, 2022 at 2:27 PM Thomas Wahlberg < >> > thomas.wahlb...@nextup.fi> >> > > wrote: >> > > >> > > > Thanks for your reply. We can indeed update to 2.20.0 and we're moving >> > > > forward with this now. Newer versions seemed to have new possible >> > issues >> > > so >> > > > we'll try with 2.20.0 for starters. >> > > > >> > > > On Tue, Nov 8, 2022 at 12:52 PM Roskvist Anton < >> > anton.roskv...@volvo.com >> > > > >> > > > wrote: >> > > > >> > > >> Hello Thomas, >> > > >> >> > > >> I might be mistaken but I believe this has been fixed in broker >> > version >> > > >> 2.20.0... it might be a good idea to update the broker version >> > > regardless >> > > >> since there has been quite a few bug fixes, added features and >> > > improvements >> > > >> made since 2.19.0. Be sure to follow the "Versions" documentation if >> > > >> upgrading: >> > > >> >> > > >> > https://activemq.apache.org/components/artemis/documentation/latest/versions.html >> > > >> >> > > >> If that's not an option you could try to set the >> > > >> "critical-analyzer-policy" to "LOG" instead to keep it from killing >> > the >> > > >> broker. This has the drawback that if there are any actual critical >> > > errors >> > > >> in the broker it will not self-terminate. Also note that I am just >> > > another >> > > >> user of the broker so any advise regarding this should be taken with a >> > > >> spoonful of "read the docs ". >> > > >> >> > > >> Br, >> > > >> Anton >> > > >> >> > > >> -----Original Message----- >> > > >> From: Thomas Wahlberg <thomas.wahlb...@nextup.fi> >> > > >> Sent: den 8 november 2022 11:27 >> > > >> To: users@activemq.apache.org >> > > >> Subject: Artemis: Artemis crashing and restarting randomly due to IO >> > > >> timeout >> > > >> >> > > >> [You don't often get email from thomas.wahlb...@nextup.fi. Learn why >> > > >> this is important at https://aka.ms/LearnAboutSenderIdentification ] >> > > >> >> > > >> CAUTION: This email originated from outside of the organization. If >> > > >> suspicious, please report it. >> > > >> >> > > >> Hi, >> > > >> we're using artemis 2.19.0 and we've noticed this is happening >> > randomly. >> > > >> Sometimes it happens twice per day, sometimes once a month. It doesn't >> > > >> seem to be related to any cron jobs or any server load. Sometimes >> > during >> > > >> the night, sometimes during the day. We're using Python with Artemis. >> > > The >> > > >> server monitoring tools don't report anything special regarding IO or >> > > >> IOWAIT during the time of the crash. Can you help us in any way? Any >> > > >> pointers on what to look for? >> > > >> >> > > >> Thank you in advance! >> > > >> >> > > >> Blow are some details: >> > > >> Artemis: 2.19.0 >> > > >> OS: Centos 7 >> > > >> openjdk version "1.8.0_131" >> > > >> OpenJDK Runtime Environment (build 1.8.0_131-b11) OpenJDK 64-Bit >> > Server >> > > >> VM (build 25.131-b11, mixed mode) >> > > >> >> > > >> INFO | jvm 6 | ***************** | WARN >> > > >> [org.apache.activemq.artemis.utils.critical.CriticalMeasure] >> > Component >> > > >> org.apache.activemq.artemis.core.io.buffer.TimedBuffer is expired on >> > > path 4 >> > > >> INFO | jvm 6 | ***************** | ERROR >> > > >> [org.apache.activemq.artemis.core.server] AMQ224079: The process for >> > the >> > > >> virtual machine will be killed, as component >> > > >> org.apache.activemq.artemis.core.io.buffer.TimedBuffer@726de316 is >> > not >> > > >> responsive >> > > >> INFO | jvm 6 | ***************** | WARN >> > > >> [org.apache.activemq.artemis.core.server] AMQ222199: Thread dump: >> > > >> >> > > >> >> > > >> > ******************************************************************************* >> > > >> INFO | jvm 6 | ***************** | Complete Thread dump >> > > >> INFO | jvm 6 | ***************** | "Reference Handler" Id=2 >> > > RUNNABLE >> > > >> INFO | jvm 6 | ***************** | at >> > > >> java.base@11.0.14 >> > > >> /java.lang.ref.Reference.waitForReferencePendingList(Native >> > > >> Method) >> > > >> INFO | jvm 6 | ***************** | at java.base@11.0.14 >> > > >> /java.lang.ref.Reference.processPendingReferences(Reference.java:241) >> > > >> INFO | jvm 6 | ***************** | at java.base@11.0.14 >> > > >> /java.lang.ref.Reference$ReferenceHandler.run(Reference.java:213) >> > > >> INFO | jvm 6 | ***************** | >> > > >> >> > > >> ... >> > > >> >> > > >> INFO | jvm 6 | ***************** | >> > > >> INFO | jvm 6 | ***************** | >> > > >> >> > > >> >> > > >> > =============================================================================== >> > > >> INFO | jvm 6 | ***************** | End Thread dump >> > > >> INFO | jvm 6 | ***************** | >> > > >> >> > > >> >> > > >> > ******************************************************************************* >> > > >> INFO | jvm 6 | ***************** | >> > > >> ERROR | wrapper | ***************** | JVM exited unexpectedly. >> > > >> STATUS | wrapper | ***************** | Reloading Wrapper >> > > configuration... >> > > >> STATUS | wrapper | ***************** | Launching a JVM... >> > > >> >> > > >> -- >> > > >> >> > > >> Thomas Wahlberg >> > > >> Chief Technology Officer >> > > >> Quriiri / NextUp Oy >> > > >> Puh. +358407031990 >> > > >> thomas.wahlb...@quriiri.fi >> > > >> >> > > >> This email message (including its attachments) is confidential and may >> > > >> contain privileged information and is intended solely for the use of >> > the >> > > >> individual and/or entity to whom it is addressed. If you are not the >> > > >> intended recipient of this e-mail you may not disseminate, distribute >> > or >> > > >> copy this e-mail (including its attachments), or any part thereof. If >> > > this >> > > >> e-mail is received in error, please notify the sender immediately by >> > > return >> > > >> e-mail and make sure that this e-mail (including its attachments), and >> > > all >> > > >> copies thereof, are immediately deleted from your system. Please >> > further >> > > >> note that when you communicate with us via email or visit our website >> > we >> > > >> process your personal data. See our privacy policy for more >> > information >> > > >> about how we process it: >> > https://www.volvogroup.com/en-en/privacy.html >> > > >> >> > > > >> > > > >> > > > -- >> > > > >> > > > Thomas Wahlberg >> > > > Chief Technology Officer >> > > > Quriiri / NextUp Oy >> > > > Puh. +358407031990 >> > > > thomas.wahlb...@quriiri.fi >> > > > >> > > >> > > >> > > -- >> > > >> > > Thomas Wahlberg >> > > Chief Technology Officer >> > > Quriiri / NextUp Oy >> > > Puh. +358407031990 >> > > thomas.wahlb...@quriiri.fi >> > > >> > -- >> > Clebert Suconic >> > >> >> >> -- >> >> Thomas Wahlberg >> Chief Technology Officer >> Quriiri / NextUp Oy >> Puh. +358407031990 >> thomas.wahlb...@quriiri.fi > > -- > Clebert Suconic -- Clebert Suconic