Today the problem occured again after 4 days of uptime. System not responding to ssh, login on the console and also bind did not seem to respond to anything at all.
I don't know if it is of any help, but here's the alt+sysreg+m output: [49824.174292] SysRq : Show Memory [49824.174367] Mem-info: [49824.174403] DMA per-cpu: [49824.174438] CPU 0: Hot: hi: 0, btch: 1 usd: 0 Cold: hi: 0, btch: 1 usd: 0 [49824.174478] CPU 1: Hot: hi: 0, btch: 1 usd: 0 Cold: hi: 0, btch: 1 usd: 0 [49824.174520] Normal per-cpu: [49824.174556] CPU 0: Hot: hi: 186, btch: 31 usd: 9 Cold: hi: 62, btch: 15 usd: 12 [49824.174598] CPU 1: Hot: hi: 186, btch: 31 usd: 27 Cold: hi: 62, btch: 15 usd: 2 [49824.174638] HighMem per-cpu: [49824.174673] CPU 0: Hot: hi: 186, btch: 31 usd: 12 Cold: hi: 62, btch: 15 usd: 2 [49824.174713] CPU 1: Hot: hi: 186, btch: 31 usd: 168 Cold: hi: 62, btch: 15 usd: 11 [49824.174755] Active:149895 inactive:82114 dirty:189 writeback:3 unstable:0 [49824.174756] free:220745 slab:60772 mapped:11447 pagetables:1007 bounce:9 [49824.174837] DMA free:15968kB min:68kB low:84kB high:100kB active:0kB inactive:0kB present:16256kB pages_scanned:0 all_unreclaimable? no [49824.174888] lowmem_reserve[]: 0 873 2015 [49824.175058] Normal free:606000kB min:3744kB low:4680kB high:5616kB active:10212kB inactive:4844kB present:894080kB pages_scanned:0 all_unreclaimable? no [49824.175111] lowmem_reserve[]: 0 0 9137 [49824.175278] HighMem free:261012kB min:512kB low:1736kB high:2960kB active:589368kB inactive:323612kB present:1169608kB pages_scanned:0 all_unreclaimable? no [49824.175332] lowmem_reserve[]: 0 0 0 [49824.175499] DMA: 2*4kB 3*8kB 2*16kB 3*32kB 3*64kB 2*128kB 2*256kB 1*512kB 0*1024kB 1*2048kB 3*4096kB = 15968kB [49824.175942] Normal: 1*4kB 1*8kB 0*16kB 0*32kB 0*64kB 0*128kB 1*256kB 1*512kB 1*1024kB 1*2048kB 147*4096kB = 605964kB [49824.176377] HighMem: 53*4kB 39*8kB 16*16kB 10*32kB 3*64kB 1*128kB 0*256kB 1*512kB 1*1024kB 0*2048kB 63*4096kB = 261004kB [49824.176824] Swap cache: add 0, delete 0, find 0/0, race 0+0 [49824.176867] Free swap = 454600kB [49824.176907] Total swap = 454600kB [49824.176947] Free swap: 454600kB [49824.183611] 524080 pages of RAM [49824.183657] 294704 pages of HIGHMEM [49824.183700] 5946 reserved pages [49824.183740] 85824 pages shared [49824.183779] 0 pages swap cached [49824.183818] 189 pages dirty [49824.183858] 3 pages writeback [49824.183897] 11447 pages mapped [49824.183942] 60772 pages slab [49824.183985] 1007 pages pagetables Nothing in the logging (I debug *.* to /var/log/all) and logging just stopped at a certain point with the last message being: Apr 26 23:47:11 muur upsd[2828]: Client on 127.0.0.1 logged out On Mon, Apr 23, 2007 at 05:58:06PM +0200, Folkert van Heusden wrote: > System: kernel 2.6.20 > P4 > 2GB ram > 512 MB swap > > Situation: > Sometimes systems stops. SSH session stop. New logins proceed but just > before the command-prompt they exit again (after a delay of, say, 30s) > and return to the login prompt. New ssh logins do not even reach the > login-prompt. Network seems to still forward traffic and also the system > seems still be doing diskflushes. > > I pressed sysreq m which gives: > > [3535562.643029] SysRq : Show Memory > [3535562.643113] Mem-info: > [3535562.643153] DMA per-cpu: > [3535562.643193] CPU 0: Hot: hi: 0, btch: 1 usd: 0 Cold: hi: > 0, btch: 1 usd: 0 > [3535562.643242] CPU 1: Hot: hi: 0, btch: 1 usd: 0 Cold: hi: > 0, btch: 1 usd: 0 > [3535562.643290] Normal per-cpu: > [3535562.643330] CPU 0: Hot: hi: 186, btch: 31 usd: 152 Cold: hi: > 62, btch: 15 usd: 13 > [3535562.643379] CPU 1: Hot: hi: 186, btch: 31 usd: 128 Cold: hi: > 62, btch: 15 usd: 0 > [3535562.643426] HighMem per-cpu: > [3535562.643466] CPU 0: Hot: hi: 186, btch: 31 usd: 156 Cold: hi: > 62, btch: 15 usd: 14 > [3535562.643516] CPU 1: Hot: hi: 186, btch: 31 usd: 81 Cold: hi: > 62, btch: 15 usd: 9 > [3535562.643566] Active:321905 inactive:152128 dirty:18 writeback:0 > unstable:0 free:21864 slab:16213 mapped:15187 pagetables:1797 > [3535562.643619] DMA free:9876kB min:68kB low:84kB high:100kB active:1276kB > inactive:3616kB present:16256kB pages_scanned:0 all_unreclaimable? no > [3535562.643670] lowmem_reserve[]: 0 873 2015 > [3535562.643838] Normal free:71724kB min:3744kB low:4680kB high:5616kB > active:349232kB inactive:375312kB present:894080kB pages_scanned:0 > all_unreclaimable? no > [3535562.643891] lowmem_reserve[]: 0 0 9137 > [3535562.644060] HighMem free:5856kB min:512kB low:1736kB high:2960kB > active:937112kB inactive:229584kB present:1169608kB pages_scanned:0 > all_unreclaimable? no > [3535562.644113] lowmem_reserve[]: 0 0 0 > [3535562.644280] DMA: 339*4kB 47*8kB 161*16kB 50*32kB 8*64kB 1*128kB 1*256kB > 0*512kB 1*1024kB 1*2048kB 0*4096kB = 9876kB > [3535562.644713] Normal: 2151*4kB 2684*8kB 1327*16kB 486*32kB 20*64kB 6*128kB > 1*256kB 1*512kB 0*1024kB 1*2048kB 0*4096kB = 71724kB > [3535562.645147] HighMem: 432*4kB 406*8kB 11*16kB 10*32kB 0*64kB 1*128kB > 1*256kB 0*512kB 0*1024kB 0*2048kB 0*4096kB = 5856kB > [3535562.645588] Swap cache: add 2474, delete 2458, find 601/816, race 0+0 > [3535562.645633] Free swap = 454424kB > [3535562.645675] Total swap = 454600kB > [3535562.645714] Free swap: 454424kB > [3535562.653672] 524080 pages of RAM > [3535562.653725] 294704 pages of HIGHMEM > [3535562.653765] 5929 reserved pages > [3535562.653807] 175200 pages shared > [3535562.653846] 16 pages swap cached > [3535562.653886] 18 pages dirty > [3535562.653926] 0 pages writeback > [3535562.653966] 15187 pages mapped > [3535562.654005] 16213 pages slab > [3535562.654046] 1797 pages pagetables > [3535580.176308] SysRq : Emergency Sync > > So it looks like it is NOT out of memory. > > Last messages before I pressed sysreg m: > > [3535514.466556] Sig 3 send to 5432 owned by 0.0 (login) > [3535514.466748] kobject_uevent_env > [3535514.466795] fill_kobj_path: path = '/class/vc/vcs3' > [3535514.466919] kobject vcs3: cleaning up > [3535514.466970] kobject_uevent_env > [3535514.467014] fill_kobj_path: path = '/class/vc/vcsa3' > [3535514.467110] kobject vcsa3: cleaning up > [3535514.467774] kobject vcs3: registering. parent: vc, set: devices > [3535514.467863] kobject_uevent_env > [3535514.467927] fill_kobj_path: path = '/class/vc/vcs3' > [3535514.468272] kobject vcsa3: registering. parent: vc, set: devices > [3535514.468356] kobject_uevent_env > [3535514.468414] fill_kobj_path: path = '/class/vc/vcsa3' > [3535514.469533] kobject_uevent_env > [3535514.469586] fill_kobj_path: path = '/class/vc/vcs3' > [3535514.469729] kobject vcs3: cleaning up > [3535514.469781] kobject_uevent_env > [3535514.469822] fill_kobj_path: path = '/class/vc/vcsa3' > [3535514.469919] kobject vcsa3: cleaning up > [3535514.469998] kobject vcs3: registering. parent: vc, set: devices > [3535514.470059] kobject_uevent_env > [3535514.470102] fill_kobj_path: path = '/class/vc/vcs3' > [3535514.470207] kobject vcsa3: registering. parent: vc, set: devices > [3535514.470260] kobject_uevent_env > [3535514.470305] fill_kobj_path: path = '/class/vc/vcsa3' > > (as you can see, the login process gets shot by signal 3 (SIGQUIT)) > > Anything else I can try? > I also found a list of stacktraces of all processes running in the > system. > > > Folkert van Heusden > > -- > To MultiTail einai ena polymorfiko ergaleio gia ta logfiles kai tin > eksodo twn entolwn. Prosferei: filtrarisma, xrwmatismo, sygxwneysi, > diaforetikes provoles. http://www.vanheusden.com/multitail/ > ---------------------------------------------------------------------- > Phone: +31-6-41278122, PGP-key: 1F28D8AE, www.vanheusden.com Folkert van Heusden -- MultiTail är en flexibel redskap för att fälja logfilar, utför av commandoer, filtrera, ge färg, sammanfoga, o.s.v. följa. http://www.vanheusden.com/multitail/ ---------------------------------------------------------------------- Phone: +31-6-41278122, PGP-key: 1F28D8AE, www.vanheusden.com - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/