I just had a Koha OOM killed on a small server the other day, so I feel your 
pain.

When you startup Starman, it won't be using very much memory, but the memory 
usage will grow with the more requests it serves. Part of that has to do with 
the loading of Perl modules needed by those new requests, but I suspect that 
there are memory leaks as well. Koha/Perl can't deallocate memory once it's 
been allocated, but it can re-use memory if the data has been "deleted" within 
Koha/Perl. So since it's just growing and growing with each request, it's a bit 
suspicious.

I actually just discovered a bug with /cgi-bin/koha/patroncards/create-pdf.pl 
which I'll be working on fixing today which could cause resource exhaustion. I 
had some Starman workers using obscene amounts of memory and CPU, and it's 
because they got trapped in infinite loops trying to create labels/cards. Fun 
times...

But in your case... I'm guessing it's probably due to bots. I've noticed 
Starman instances that get a lot of bot hits balloon in memory usage. You might 
want to look at your "plack_max_requests" in koha-conf.xml. Once the Starman 
worker reaches its max requests, it'll be killed off, and the memory released. 
But then there's some CPU overhead to starting up a new Starman worker process. 
So there's a cost/benefit to do there.

It can be tough troubleshooting these things after the fact, so I'd suggest 
putting in some monitoring, which alerts you once your memory usage starts 
getting high. That way you can troubleshoot it more in real time. That said, 
troubleshooting memory use can be tricky... 

David Cook
Senior Software Engineer
Prosentient Systems
Suite 7.03
6a Glen St
Milsons Point NSW 2061
Australia

Office: 02 9212 0899

-----Original Message-----
From: Koha-devel <koha-devel-boun...@lists.koha-community.org> On Behalf Of 
Michael Kuhn via Koha-devel
Sent: Tuesday, 11 March 2025 3:27 AM
To: Koha-devel <koha-devel@lists.koha-community.org>
Cc: Hallmann, Maximilian <hallm...@landesarchiv.berlin.de>; Erdmann, Dominique 
<erdm...@landesarchiv.berlin.de>
Subject: [Koha-devel] oom-killer / Out of memory: Killed process 1026641 
(/usr/share/koha)

Hi

Our host "bib" owns 4 virtual CPU, 8 GB RAM and 18 GB swap space. It runs on 
Debian GNU/Linux 12 with Koha 24.05.01 and around 78'000 bibliographic records.

Since some time Koha sporadically just stops (today already twice), in such 
cases log file "/var/log/syslog" contains messages as attached at the end of 
this e-mail (the "oom-killer" is not always invoked by memcached). The messages 
say the host is out of memory so the Koha process was killed... I have already 
increased the swap space from 2 GB to 18 GB but to no avail.

How can we find out what is consuming so much memory causing Koha to get 
killed? How can we avoid this?

Best wishes: Michael
--
Geschäftsführer · Diplombibliothekar BBS, Informatiker eidg. Fachausweis Admin 
Kuhn GmbH · Pappelstrasse 20 · 4123 Allschwil · Schweiz T 0041 (0)61 261 55 61 
· E m...@adminkuhn.ch · W www.adminkuhn.ch




2025-03-10T12:48:00.668764+01:00 bib kernel: [769596.762201] memcached 
invoked oom-killer: gfp_mask=0x140cca(GFP_HIGHUSER_MOVABLE|__GFP_COMP), 
order=0, oom_score_adj=0
2025-03-10T12:48:00.670036+01:00 bib kernel: [769596.762885] CPU: 3 PID: 
2332685 Comm: memcached Not tainted 6.1.0-31-amd64 #1  Debian 6.1.128-1
2025-03-10T12:48:00.670038+01:00 bib kernel: [769596.763487] Hardware 
name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.14.0-2 04/01/2014
2025-03-10T12:48:00.670039+01:00 bib kernel: [769596.764019] Call Trace:
2025-03-10T12:48:00.670039+01:00 bib kernel: [769596.764528]  <TASK>
2025-03-10T12:48:00.670040+01:00 bib kernel: [769596.765027] 
dump_stack_lvl+0x44/0x5c
2025-03-10T12:48:00.670040+01:00 bib kernel: [769596.765539] 
dump_header+0x4a/0x211
2025-03-10T12:48:00.670041+01:00 bib kernel: [769596.766045] 
oom_kill_process.cold+0xb/0x10
2025-03-10T12:48:00.670041+01:00 bib kernel: [769596.766535] 
out_of_memory+0x1fd/0x4c0
2025-03-10T12:48:00.670042+01:00 bib kernel: [769596.767046] 
__alloc_pages_slowpath.constprop.0+0xc83/0xe40
2025-03-10T12:48:00.670042+01:00 bib kernel: [769596.767537] 
__alloc_pages+0x305/0x330
2025-03-10T12:48:00.670043+01:00 bib kernel: [769596.768014] 
folio_alloc+0x17/0x50
2025-03-10T12:48:00.670043+01:00 bib kernel: [769596.768499] 
__filemap_get_folio+0x155/0x340
2025-03-10T12:48:00.670044+01:00 bib kernel: [769596.768967] 
filemap_fault+0x139/0x910
2025-03-10T12:48:00.670044+01:00 bib kernel: [769596.769395]  ? 
filemap_map_pages+0x153/0x700
2025-03-10T12:48:00.670044+01:00 bib kernel: [769596.769820] 
__do_fault+0x33/0x110
2025-03-10T12:48:00.670045+01:00 bib kernel: [769596.770303] 
do_fault+0x1b9/0x410
2025-03-10T12:48:00.670157+01:00 bib kernel: [769596.770734] 
__handle_mm_fault+0x660/0xfa0
2025-03-10T12:48:00.670298+01:00 bib kernel: [769596.771189] 
handle_mm_fault+0xdb/0x2d0
2025-03-10T12:48:00.670299+01:00 bib kernel: [769596.771594] 
do_user_addr_fault+0x191/0x550
2025-03-10T12:48:00.670300+01:00 bib kernel: [769596.771982] 
exc_page_fault+0x70/0x170
2025-03-10T12:48:00.670300+01:00 bib kernel: [769596.772411] 
asm_exc_page_fault+0x22/0x30
2025-03-10T12:48:00.670301+01:00 bib kernel: [769596.772829] RIP: 
0033:0x557b12827140
2025-03-10T12:48:00.670302+01:00 bib kernel: [769596.773207] Code: 
Unable to access opcode bytes at 0x557b12827116.
2025-03-10T12:48:00.670302+01:00 bib kernel: [769596.773589] RSP: 
002b:00007ffc7f74a9c8 EFLAGS: 00010246
2025-03-10T12:48:00.670303+01:00 bib kernel: [769596.773991] RAX: 
00007fc1802bd440 RBX: 0000557b44749ac0 RCX: 0000000000000669
2025-03-10T12:48:00.670303+01:00 bib kernel: [769596.774387] RDX: 
0000000000000000 RSI: 0000000000000001 RDI: 00000000ffffffff
2025-03-10T12:48:00.670303+01:00 bib kernel: [769596.774723] RBP: 
0000000000000001 R08: 0000557b12827140 R09: 00007fc1802dd000
2025-03-10T12:48:00.670304+01:00 bib kernel: [769596.775111] R10: 
0000000000000000 R11: 0000000000000000 R12: 0000000000000001
2025-03-10T12:48:00.670305+01:00 bib kernel: [769596.775506] R13: 
0000000000000000 R14: 000000007fffffff R15: 0000557b1285cb80
2025-03-10T12:48:00.670305+01:00 bib kernel: [769596.775831]  </TASK>
2025-03-10T12:48:00.670305+01:00 bib kernel: [769596.776226] Mem-Info:
2025-03-10T12:48:00.670365+01:00 bib kernel: [769596.776592] 
active_anon:550829 inactive_anon:1391840 isolated_anon:0
2025-03-10T12:48:00.670366+01:00 bib kernel: [769596.776592] 
active_file:0 inactive_file:94 isolated_file:0
2025-03-10T12:48:00.670366+01:00 bib kernel: [769596.776592] 
unevictable:0 dirty:0 writeback:0
2025-03-10T12:48:00.670367+01:00 bib kernel: [769596.776592] 
slab_reclaimable:9935 slab_unreclaimable:13188
2025-03-10T12:48:00.670367+01:00 bib kernel: [769596.776592] 
mapped:1134 shmem:5697 pagetables:5740
2025-03-10T12:48:00.670367+01:00 bib kernel: [769596.776592] 
sec_pagetables:0 bounce:0
2025-03-10T12:48:00.670368+01:00 bib kernel: [769596.776592] 
kernel_misc_reclaimable:0
2025-03-10T12:48:00.670368+01:00 bib kernel: [769596.776592]  free:25298 
free_pcp:0 free_cma:0
2025-03-10T12:48:00.670368+01:00 bib kernel: [769596.789186] Node 0 
active_anon:2203316kB inactive_anon:5567360kB active_file:80kB 
inactive_file:156kB unevictable:0kB isolated(anon):0kB 
isolated(file):0kB mapped:4536kB dirty:0kB writeback:0kB shmem:22788kB 
shmem_thp: 0kB shmem_pmdmapped: 0kB anon_thp: 876544kB writeback_tmp:0kB 
kernel_stack:9056kB pagetables:22960kB sec_pagetables:0kB 
all_unreclaimable? no
2025-03-10T12:48:00.670369+01:00 bib kernel: [769596.790069] Node 0 DMA 
free:14336kB boost:0kB min:128kB low:160kB high:192kB 
reserved_highatomic:0KB active_anon:0kB inactive_anon:0kB 
active_file:0kB inactive_file:0kB unevictable:0kB writepending:0kB 
present:15992kB managed:15360kB mlocked:0kB bounce:0kB free_pcp:0kB 
local_pcp:0kB free_cma:0kB
2025-03-10T12:48:00.670370+01:00 bib kernel: [769596.790924] 
lowmem_reserve[]: 0 2899 7843 7843 7843
2025-03-10T12:48:00.670370+01:00 bib kernel: [769596.791252] Node 0 
DMA32 free:44376kB boost:0kB min:24936kB low:31168kB high:37400kB 
reserved_highatomic:0KB active_anon:842432kB inactive_anon:2119316kB 
active_file:0kB inactive_file:84kB unevictable:0kB writepending:0kB 
present:3129196kB managed:3018408kB mlocked:0kB bounce:0kB free_pcp:0kB 
local_pcp:0kB free_cma:0kB
2025-03-10T12:48:00.670370+01:00 bib kernel: [769596.792175] 
lowmem_reserve[]: 0 0 4943 4943 4943
2025-03-10T12:48:00.670371+01:00 bib kernel: [769596.792455] Node 0 
Normal free:42480kB boost:0kB min:42516kB low:53144kB high:63772kB 
reserved_highatomic:0KB active_anon:1360884kB inactive_anon:3447664kB 
active_file:120kB inactive_file:0kB unevictable:0kB writepending:0kB 
present:5242880kB managed:5062604kB mlocked:0kB bounce:0kB free_pcp:0kB 
local_pcp:0kB free_cma:0kB
2025-03-10T12:48:00.670371+01:00 bib kernel: [769596.793398] 
lowmem_reserve[]: 0 0 0 0 0
2025-03-10T12:48:00.670372+01:00 bib kernel: [769596.793716] Node 0 DMA: 
0*4kB 0*8kB 0*16kB 0*32kB 0*64kB 0*128kB 0*256kB 0*512kB 0*1024kB 
1*2048kB (M) 3*4096kB (M) = 14336kB
2025-03-10T12:48:00.670372+01:00 bib kernel: [769596.794404] Node 0 
DMA32: 115*4kB (UME) 68*8kB (UME) 110*16kB (UME) 94*32kB (UME) 78*64kB 
(UME) 56*128kB (UE) 40*256kB (UME) 15*512kB (E) 3*1024kB (UE) 3*2048kB 
(E) 0*4096kB = 45068kB
2025-03-10T12:48:00.670373+01:00 bib kernel: [769596.795118] Node 0 
Normal: 412*4kB (UME) 307*8kB (UME) 166*16kB (UME) 167*32kB (UME) 
83*64kB (UME) 50*128kB (UME) 23*256kB (E) 13*512kB (ME) 7*1024kB (ME) 
0*2048kB 0*4096kB = 43528kB
2025-03-10T12:48:00.670373+01:00 bib kernel: [769596.795916] Node 0 
hugepages_total=0 hugepages_free=0 hugepages_surp=0 hugepages_size=1048576kB
2025-03-10T12:48:00.670373+01:00 bib kernel: [769596.796271] Node 0 
hugepages_total=0 hugepages_free=0 hugepages_surp=0 hugepages_size=2048kB
2025-03-10T12:48:00.670374+01:00 bib kernel: [769596.796713] 155111 
total pagecache pages
2025-03-10T12:48:00.670374+01:00 bib kernel: [769596.797122] 149255 
pages in swap cache
2025-03-10T12:48:00.670374+01:00 bib kernel: [769596.797463] Free swap 
= 0kB
2025-03-10T12:48:00.670375+01:00 bib kernel: [769596.797848] Total swap 
= 1998844kB
2025-03-10T12:48:00.670922+01:00 bib kernel: [769596.798238] 2097017 
pages RAM
2025-03-10T12:48:00.670926+01:00 bib kernel: [769596.798577] 0 pages 
HighMem/MovableOnly
2025-03-10T12:48:00.670926+01:00 bib kernel: [769596.798954] 72924 pages 
reserved
2025-03-10T12:48:00.671386+01:00 bib kernel: [769596.799356] 0 pages 
hwpoisoned
2025-03-10T12:48:00.671388+01:00 bib kernel: [769596.799693] Tasks state 
(memory values in pages):
2025-03-10T12:48:00.671388+01:00 bib kernel: [769596.800070] [  pid  ] 
uid  tgid total_vm      rss pgtables_bytes swapents oom_score_adj name
2025-03-10T12:48:00.671389+01:00 bib kernel: [769596.800463] [    274] 
   0   274    30075       22   241664      274          -250 systemd-journal
2025-03-10T12:48:00.671389+01:00 bib kernel: [769596.800839] [    295] 
   0   295     6770      278    86016      221         -1000 systemd-udevd
2025-03-10T12:48:00.671390+01:00 bib kernel: [769596.801224] [    524] 
997   524    22521      154    77824       82             0 systemd-timesyn
2025-03-10T12:48:00.671390+01:00 bib kernel: [769596.801603] [    609] 
   0   609     2372       35    53248       28             0 cron
2025-03-10T12:48:00.671390+01:00 bib kernel: [769596.801997] [    610] 
100   610     2011      143    61440        8          -900 dbus-daemon
2025-03-10T12:48:00.671391+01:00 bib kernel: [769596.802387] [    612] 
   0   612    55447      184    81920      116             0 rsyslogd
2025-03-10T12:48:00.671391+01:00 bib kernel: [769596.802749] [    613] 
   0   613     4185      203    73728       73             0 systemd-logind
2025-03-10T12:48:00.671391+01:00 bib kernel: [769596.803147] [    629] 
   0   629      895       40    49152        3             0 atd
2025-03-10T12:48:00.671392+01:00 bib kernel: [769596.803558] [    666] 
   0   666     1499      202    53248       45             0 dhclient
2025-03-10T12:48:00.671392+01:00 bib kernel: [769596.803913] [    715] 
   0   715     1435      100    49152      105             0 dhclient
2025-03-10T12:48:00.671393+01:00 bib kernel: [769596.804313] [    747] 
104   747     2208      550    61440       34             0 epmd
2025-03-10T12:48:00.671393+01:00 bib kernel: [769596.804716] [    753] 
   0   753   173229     3280   430080      880             0 fail2ban-server
2025-03-10T12:48:00.671393+01:00 bib kernel: [769596.805078] [    774] 
106   774   920414    28673  1294336    23431          -500 beam.smp
2025-03-10T12:48:00.671393+01:00 bib kernel: [769596.805494] [    797] 
   0   797     2188       20    53248        2             0 agetty
2025-03-10T12:48:00.671394+01:00 bib kernel: [769596.806006] [    812] 
   0   812     3859      213    65536      120         -1000 sshd
2025-03-10T12:48:00.671394+01:00 bib kernel: [769596.806567] [    841] 
106   841      616       19    53248        6          -500 erl_child_setup
2025-03-10T12:48:00.671395+01:00 bib kernel: [769596.807045] [   1774] 
   0  1774    10665      108    69632       53             0 master
2025-03-10T12:48:00.671395+01:00 bib kernel: [769596.807442] [   1779] 
102  1779    10700      127    81920       48             0 qmgr
2025-03-10T12:48:00.671396+01:00 bib kernel: [769596.807848] [   2023] 
106  2023      848       11    45056       14          -500 inet_gethost
2025-03-10T12:48:00.671396+01:00 bib kernel: [769596.808192] [   2026] 
106  2026      848       24    45056        8          -500 inet_gethost
2025-03-10T12:48:00.671397+01:00 bib kernel: [769596.808560] [   2034] 
106  2034      644       14    40960       14          -500 sh
2025-03-10T12:48:00.671397+01:00 bib kernel: [769596.808953] [   3262] 
103  3262  2058793   518769  7716864   392357             0 mariadbd
2025-03-10T12:48:00.671397+01:00 bib kernel: [769596.809287] [ 392666] 
102 392666    12109      128    90112      225             0 tlsmgr
2025-03-10T12:48:00.671398+01:00 bib kernel: [769596.809647] [2332677] 
   0 2332677     3456      434    61440      303             0 apache2
2025-03-10T12:48:00.671398+01:00 bib kernel: [769596.810031] [2332685] 
105 2332685   107470     2677   159744     2787             0 memcached
2025-03-10T12:48:00.671399+01:00 bib kernel: [769596.810358] [2332735] 
1003 2332735     2128       21    49152       15             0 daemon
2025-03-10T12:48:00.671399+01:00 bib kernel: [769596.810703] [2332737] 
1003 2332737    14826      161   110592      256             0 zebrasrv
2025-03-10T12:48:00.671399+01:00 bib kernel: [769596.811075] [2332802] 
1003 2332802     6211     2612    94208      287             0 starman 
master
2025-03-10T12:48:00.671400+01:00 bib kernel: [769596.811400] [2332846] 
1003 2332846     2128       37    49152        0             0 daemon
2025-03-10T12:48:00.671400+01:00 bib kernel: [769596.811734] [2332847] 
1003 2332847    29751    13047   221184      414             0 
background_jobs
2025-03-10T12:48:00.671401+01:00 bib kernel: [769596.812076] [2332875] 
1003 2332875     2128       21    45056       14             0 daemon
2025-03-10T12:48:00.671401+01:00 bib kernel: [769596.812396] [2332877] 
1003 2332877    41320    19112   323584     4526             0 
background_jobs
2025-03-10T12:48:00.671401+01:00 bib kernel: [769596.812718] [2332902] 
1003 2332902     2128       13    49152       22             0 daemon
2025-03-10T12:48:00.671402+01:00 bib kernel: [769596.813044] [2332904] 
1003 2332904    73759    36200   540672     6622             0 
rebuild_zebra.p
2025-03-10T12:48:00.671402+01:00 bib kernel: [769596.813369] [1011996] 
102 1011996    10671      154    73728        0             0 pickup
2025-03-10T12:48:00.671402+01:00 bib kernel: [769596.813651] [1014020] 
  33 1014020     3517      422    61440      316             0 apache2
2025-03-10T12:48:00.671403+01:00 bib kernel: [769596.813976] [1015250] 
  33 1015250     3517      426    61440      311             0 apache2
2025-03-10T12:48:00.671403+01:00 bib kernel: [769596.814307] [1018044] 
  33 1018044     3517      472    61440      265             0 apache2
2025-03-10T12:48:00.671403+01:00 bib kernel: [769596.814583] [1018109] 
  33 1018109     3517      465    61440      272             0 apache2
2025-03-10T12:48:00.671404+01:00 bib kernel: [769596.814882] [1018113] 
  33 1018113     3517      473    61440      265             0 apache2
2025-03-10T12:48:00.671404+01:00 bib kernel: [769596.815171] [1020113] 
  33 1020113     3517      473    61440      264             0 apache2
2025-03-10T12:48:00.671404+01:00 bib kernel: [769596.815506] [1024635] 
  33 1024635     3517      474    61440      264             0 apache2
2025-03-10T12:48:00.671405+01:00 bib kernel: [769596.815796] [1026139] 
  33 1026139     3517      474    61440      264             0 apache2
2025-03-10T12:48:00.671405+01:00 bib kernel: [769596.816089] [1026170] 
  33 1026170     3517      473    61440      264             0 apache2
2025-03-10T12:48:00.671405+01:00 bib kernel: [769596.816378] [1026641] 
1003 1026641  1185289  1122584  9445376    29621             0 
/usr/share/koha
2025-03-10T12:48:00.671406+01:00 bib kernel: [769596.816688] [1026652] 
  33 1026652     3517      473    61440      264             0 apache2
2025-03-10T12:48:00.671406+01:00 bib kernel: [769596.816972] [1027075] 
1003 1027075    99496    34733   749568    33424             0 
/usr/share/koha
2025-03-10T12:48:00.671407+01:00 bib kernel: [769596.817289] [1027426] 
1003 1027426     3594      624    65536      241             0 apache2
2025-03-10T12:48:00.671416+01:00 bib kernel: [769596.817615] [1027453] 
1003 1027453     3592      669    61440      240             0 apache2
2025-03-10T12:48:00.671416+01:00 bib kernel: [769596.817890] [1027749] 
1003 1027749    16877      179   114688     2268             0 zebrasrv
2025-03-10T12:48:00.671416+01:00 bib kernel: [769596.818218] [1027761] 
1003 1027761     3538      563    61440      244             0 apache2
2025-03-10T12:48:00.671417+01:00 bib kernel: [769596.818501] [1027769] 
1003 1027769    16966     2354   114688      229             0 zebrasrv
2025-03-10T12:48:00.671417+01:00 bib kernel: [769596.818812] [1027770] 
1003 1027770     3523      559    61440      241             0 apache2
2025-03-10T12:48:00.671417+01:00 bib kernel: [769596.819085] 
oom-kill:constraint=CONSTRAINT_NONE,nodemask=(null),cpuset=/,mems_allowed=0,global_oom,task_memcg=/system.slice/koha-common.service,task=/usr/share/koha,pid=1026641,uid=1003
2025-03-10T12:48:00.671418+01:00 bib kernel: [769596.819717] Out of 
memory: Killed process 1026641 (/usr/share/koha) total-vm:4741156kB, 
anon-rss:4490332kB, file-rss:4kB, shmem-rss:0kB, UID:1003 
pgtables:9224kB oom_score_adj:0
2025-03-10T12:48:00.721396+01:00 bib systemd[1]: koha-common.service: A 
process of this unit has been killed by the OOM killer.
2025-03-10T12:48:03.702481+01:00 bib koha-common[1027819]: Stopping 
Plack daemon for bib:.
2025-03-10T12:48:03.755927+01:00 bib koha-common[1027861]: Stopping Koha 
worker daemon for bib (default):.
2025-03-10T12:48:03.789372+01:00 bib koha-common[1027886]: Stopping Koha 
worker daemon for bib (long_tasks):.
2025-03-10T12:48:03.825612+01:00 bib koha-common[1027911]: Stopping Koha 
indexing daemon for bib:.
2025-03-10T12:48:05.057975+01:00 bib systemd[1]: koha-common.service: 
Failed with result 'oom-kill'.
2025-03-10T12:48:05.058217+01:00 bib systemd[1]: koha-common.service: 
Consumed 2d 22h 25min 610ms CPU time.
_______________________________________________
Koha-devel mailing list
Koha-devel@lists.koha-community.org
https://lists.koha-community.org/cgi-bin/mailman/listinfo/koha-devel
website : https://www.koha-community.org/
git : https://git.koha-community.org/
bugs : https://bugs.koha-community.org/

_______________________________________________
Koha-devel mailing list
Koha-devel@lists.koha-community.org
https://lists.koha-community.org/cgi-bin/mailman/listinfo/koha-devel
website : https://www.koha-community.org/
git : https://git.koha-community.org/
bugs : https://bugs.koha-community.org/

Reply via email to