Hi! > > experiencing several crashes. I think it's because the 2.4 kernel (but > > if not, just tell me please ;), so I decided to downgrade the kernel to > > 2.2 (whis is available in woody). > It's not possible for us to even comment on the likely cause of your problems. Ok, to be more specific. The customer ran a script nightly from cron, and the script is very resource intensive. (commandline PHP script, with high Postgres usage).
After the script started, various problems happened, often the whole sytem died. I found entries like this in the logfiles: : Code: ff 0b 0f 94 c0 84 c0 0f 84 88 00 00 00 8b 43 0c 50 e8 40 66 : <1>Unable to handle kernel paging request at virtual address 0000c080 : printing eip: : c011b676 : *pde = 00000000 : Oops: 0002 : CPU: 0 : EIP: 0010:[exit_sighand+26/80] Not tainted : EFLAGS: 00010002 : eax: 00000000 ebx: c932c000 ecx: c932c264 edx: 0000c080 : esi: c932c000 edi: 0000000b ebp: 00007960 esp: c932dc78 : ds: 0018 es: 0018 ss: 0018 : Process postmaster (pid: 13023, stackpage=c932d000) : Stack: 00000000 c0116ef5 c932c000 00000002 00000002 c01113c8 c010704b 0000000b : 00000000 c0111717 c020bd1e c932dd54 00000002 c932c000 00000002 c01113c8 : 00008080 00000000 c016c6ce c932c000 c00bbac0 00000001 00030001 c02a3f14 : Call Trace: [do_exit+349/460] [do_page_fault+0/1176] [die+79/80] [do_page_faul : [set_cursor+110/128] [vt_console_print+718/736] [__call_console_drivers+58/ : [do_exit+194/460] [do_page_fault+0/1176] [die+79/80] [do_page_fault+847/117 : [page_cache_release+44/48] [free_page_and_swap_cache+49/52] [__free_pte+58/ : [die+79/80] [do_page_fault+847/1176] [do_page_fault+0/1176] [file_read_acto : [generic_file_read+129/308] [error_code+52/60] [fget+17/40] [sys_read+22/20 But the same happened with the other daemons too... The system is a PIII@833 with 256Mb of ECC SDRAM. I was using a self compiled 2.4.17, but the same happened with 2.4.9. Well, I compiled a 2.2.20 kernel, and installed it with no problems. It seems to be stable now, but it needs some more testing. (I started the script, runs smoothly). I don't know if you get something usable from the above log, if not, please tell me what should I send. Best Regards, David Biro > > To be able to make any guess at the cause of your problems we need to know > the exact kernel version, the details of your hardware, and some information > on what your server does would be handy (but is not as important). Without > such information it's not even worth guessing. > > FWIW All my most important machines run 2.4.x machines and are quite > reliable. Currently the only reliability issue I have in any of my machines > is directly related to a binary-only kernel module. Apart from that all my > machines are capable of running for months at a time (apart from power > failures, kernel upgrades, hardware changes, etc). > > -- > Signatures >4 lines are rude. If you send email to me or to a mailing list > that I am subscribed to which has >4 lines of legalistic junk at the end > then you are specifically authorizing me to do whatever I wish with the > message (the sig won't be read). -- [ http://www.wh.hu - webhosting - webdesign - http://www.wh.hu ] [ mailto:[EMAIL PROTECTED] - mailto:[EMAIL PROTECTED] ] -- To UNSUBSCRIBE, email to [EMAIL PROTECTED] with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]