Re: [CentOS] EDAC Kernel Panic 2.6.9-78 and above

2009-10-22 Thread William L. Maltby
On Thu, 2009-10-22 at 04:20 -0400, ken wrote: > > cat /boot/grub/menu.lst > ... > title CentOS (2.6.18-164.2.1.el5.plus) > root (hd0,2) > kernel /vmlinuz-2.6.18-164.2.1.el5.plus ro > root=/dev/mapper/luks-3d723b4f-0184-438d-9cb9-9ebff16e683a rhgb quiet > initrd /initrd-2.

Re: [CentOS] EDAC Kernel Panic 2.6.9-78 and above

2009-10-22 Thread ken
On 10/21/2009 10:21 PM Philip Gwyn wrote: > On 20-Oct-2009 Michael Schumacher wrote: >>> I've got a production system running CentOS 4 that was rock solid >>> until I upgraded from 2.6.9-55 to 2.6.9-78.0.13 (now running >>> 2.6.9-89.0.11). The system now crashes intermittently after a few >>> week

Re: [CentOS] EDAC Kernel Panic 2.6.9-78 and above

2009-10-21 Thread Philip Gwyn
On 20-Oct-2009 Michael Schumacher wrote: >> I've got a production system running CentOS 4 that was rock solid >> until I upgraded from 2.6.9-55 to 2.6.9-78.0.13 (now running >> 2.6.9-89.0.11). The system now crashes intermittently after a few >> weeks. I finally caught the panic message : > >> ED

Re: [CentOS] EDAC Kernel Panic 2.6.9-78 and above

2009-10-20 Thread Chris Miller
nate wrote: > Check your bios/system event log for any indication that it > is logging memory errors? Most modern server class motherboards > (past 5 years) do this, though not always reliably. Nothing in the logs, it's a Supermicro X7DVL-E (fyi). > I've also had trouble with memtest86 myself,

Re: [CentOS] EDAC Kernel Panic 2.6.9-78 and above

2009-10-19 Thread Michael Schumacher
Chris, > I've got a production system running CentOS 4 that was rock solid > until I upgraded from 2.6.9-55 to 2.6.9-78.0.13 (now running > 2.6.9-89.0.11). The system now crashes intermittently after a few > weeks. I finally caught the panic message : > EDAC MC0: INTERNAL ERROR: channel-b out of

Re: [CentOS] EDAC Kernel Panic 2.6.9-78 and above

2009-10-19 Thread nate
Chris Miller wrote: > Thoughts? Check your bios/system event log for any indication that it is logging memory errors? Most modern server class motherboards (past 5 years) do this, though not always reliably. I've also had trouble with memtest86 myself, I prefer to run ctcs: http://sourceforge.n

[CentOS] EDAC Kernel Panic 2.6.9-78 and above

2009-10-19 Thread Chris Miller
I've got a production system running CentOS 4 that was rock solid until I upgraded from 2.6.9-55 to 2.6.9-78.0.13 (now running 2.6.9-89.0.11). The system now crashes intermittently after a few weeks. I finally caught the panic message : EDAC MC0: INTERNAL ERROR: channel-b out of range (4 >= 4) Ke