Re: [CentOS] Hard I/O lockup with EL6

2011-09-27 Thread Emmanuel Noobadmin
On 9/27/11, Benjamin Smith wrote: > I wish you the best of luck! Fortunately (or unfortunately depending on how one looks at it), mine appears to be just bad sectors developing on one of the "newest" drive I added to the machine as part of a mdadm RAID 1 array. After I rebooted the server and a

Re: [CentOS] Hard I/O lockup with EL6

2011-09-26 Thread Benjamin Smith
On Monday, September 26, 2011 10:16:14 PM Emmanuel Noobadmin wrote: > On 9/27/11, Benjamin Smith wrote: > > When booting a non-working system, it boots straight up to the boot > > prompt (runlevel 3) without issue, and everything works fine. When the > > machine sits idle for a period of time (r

Re: [CentOS] Hard I/O lockup with EL6

2011-09-26 Thread Emmanuel Noobadmin
On 9/27/11, Benjamin Smith wrote: > When booting a non-working system, it boots straight up to the boot prompt > (runlevel 3) without issue, and everything works fine. When the machine sits > idle for a period of time (ranging from 15 minutes or so and up) the HDD > becomes unreadable/unwritable a

Re: [CentOS] Hard I/O lockup with EL6

2011-09-26 Thread Ross Walker
On Sep 26, 2011, at 3:11 PM, Benjamin Smith wrote: > I'm trying to figure out why 2 machines have a "hard I/O lock" on the HDD > when > running EL6. > > I have 4 identical machines, all were stable with EL5. 2 work great with EL6, > 2 do not. I've checked momtherboard BIOS versions and setti

Re: [CentOS] Hard I/O lockup with EL6

2011-09-26 Thread Devin Reade
--On Monday, September 26, 2011 03:13:09 PM -0700 Benjamin Smith wrote: > Thanks for the feedback. Unfortunately, these aren't ancient 686 systems, > they are 1-ish year old 8-core Intel Xeons with 32 GB of ECC RAM apiece. > I can't justify replacing them, especially since two of the four are >

Re: [CentOS] Hard I/O lockup with EL6

2011-09-26 Thread Scott Silva
on 9/26/2011 3:13 PM Benjamin Smith spake the following: > On Monday, September 26, 2011 02:42:18 PM Devin Reade wrote: >> --On Monday, September 26, 2011 12:11:47 PM -0700 Benjamin Smith >> Unfortunately in trying to use C6 on the old machine I wound up with >> far too many changed variables to fi

Re: [CentOS] Hard I/O lockup with EL6

2011-09-26 Thread Benjamin Smith
On Monday, September 26, 2011 02:42:18 PM Devin Reade wrote: > --On Monday, September 26, 2011 12:11:47 PM -0700 Benjamin Smith > Unfortunately in trying to use C6 on the old machine I wound up with > far too many changed variables to figure out where the problem was. > Despite that, my gut tells m

Re: [CentOS] Hard I/O lockup with EL6

2011-09-26 Thread Devin Reade
--On Monday, September 26, 2011 12:11:47 PM -0700 Benjamin Smith wrote: > I'm trying to figure out why 2 machines have a "hard I/O lock" on the HDD > when running EL6. I _won't_ chime in with a "check your ". Instead here's a potentially useless datapoint: I have an older but still usuable 3

Re: [CentOS] Hard I/O lockup with EL6

2011-09-26 Thread Benjamin Smith
On Monday, September 26, 2011 02:00:52 PM Brian McKerr wrote: > Have you checked the cables you are using ? There are none - it's a front-loaded hot-swap rackmount. The systems are stable under EL5. -- This message has been scanned for viruses and dangerous content by MailScanner, and is belie

Re: [CentOS] Hard I/O lockup with EL6

2011-09-26 Thread Brian McKerr
Have you checked the cables you are using ? On Tue, Sep 27, 2011 at 6:09 AM, Benjamin Smith wrote: > On Monday, September 26, 2011 12:36:19 PM m.r...@5-cent.us wrote: > > a) have you checked > > /var/log/message for memory or drive errors? > > Looked through the logs, there's *nothing* I can fin

Re: [CentOS] Hard I/O lockup with EL6

2011-09-26 Thread Benjamin Smith
On Monday, September 26, 2011 12:36:19 PM m.r...@5-cent.us wrote: > a) have you checked > /var/log/message for memory or drive errors? Looked through the logs, there's *nothing* I can find that's out of sorts. When the IO problem happens, nothing can be written. > Maybe memtest86? I replaced

Re: [CentOS] Hard I/O lockup with EL6

2011-09-26 Thread m . roth
Benjamin Smith wrote: > I'm trying to figure out why 2 machines have a "hard I/O lock" on the HDD > when > running EL6. > > I have 4 identical machines, all were stable with EL5. 2 work great with > EL6, 2 do not. I've checked momtherboard BIOS versions and settings, SAS > controller BIOS versions