On Wednesday 20 Mar 2013 04:42:28 Carlos Hendson wrote:
> Hello,
> 
> For last few weeks or so, I've been getting intermittent hard lock-ups
> during the emerge of various packages.  It appears the more compile
> intensive the package, the more likely the lock-up.  These lock-ups have
> occurred under kernels 3.4.9 and 3.7.10 with gcc 4.5.4 and 4.6.3.
> 
> Once the machine is in a frozen state, the only thing that responds is
> the soft power reset button.  Some times the machine lock-ups again
> after the button is pressed (this is because the compile resumes once
> the system comes out of it's frozen state).
> 
> If the system subsequently lock-ups because I wasn't able to cancel the
> compile fast enough only a only option left is a hard power reset (10sec
> + hold power button).  If I cancel the compile, the system is perfectly
> responsive and functions normally.
> 
> There are kernel stack traces in /var/log/messages which I'm unable to
> decipher and diagnose as to what caused the lock-up.
> 
> If I had to guess, I'd blame an incorrect setting in the .config, but
> since I'm stuck in the diagnostic of what part of the kernel might be
> experiencing the problem, I need a bit of help to pin point the issue.
> 
> I believe it to be a kernel configuration issue because when I booted
> the machine using a system rescue Live CD, I was able to chroot into the
> system and emerge packages like gcc without the lock-up problem
> occurring.
> 
> That's by no means conclusive, however, I've also run a complete pass of
> memcheck for over an hour without any issues reported.
> 
> I'd like to completely rule out hardware failure, what diagnostic tools
> tools are recommend to try identify potential hardware issue of this
> type?
> 
> The various kernel stack traces are attached in case someone wants to
> take a look.  I can provide more information should it be needed.
> 
> Any help or advice would be appreciated.
> 
> Regards,
> Carlos

Stating the obvious, it seems that the kernel is struggling and indeed you may 
have come across some nasty kernel bug.  However, it could well be that it is 
not related to the kernel you're running, or your kernel config.  It could be 
a problem with the power supply being faulty and causing these lock ups.

Unless someone else comes up with a better idea to troubleshoot it further, I 
would consider replacing the power supply with another of a known good 
condition.
-- 
Regards,
Mick

Attachment: signature.asc
Description: This is a digitally signed message part.

Reply via email to