One more thing:
This server has ECC memory, and it also has a BMC controller that externally 
logs all hardware errors independently of the memory condition.
>From what I understand, ECC memory module has an extra memory chip for data 
>hashes. When data is written into ECC memory, the hash is created and stored 
>in the extra chip, and when it's retrieved, the hash is checked. When a 
>mismatch occurs, even if kernel does not log such error, the BMC controller 
>logs it as 
# ipmitool sel list
1c10 | 02/19/2009 | 16:29:39 | Memory #0x08 | Uncorrectable ECC |
..
I retrieved the system even log (SEL) list from the BMC controller, and  are no 
errors whatsoever.

This looks like a Python error or something very basic. It started happening 
after I slightly changed the code for this particular server.

-----Original Message-----
From: Alec Matusis [mailto:matu...@yahoo.com] 
Sent: Tuesday, March 24, 2009 6:26 PM
To: 'Twisted general discussion'
Subject: RE: [Twisted-Python] strange server crash

This server crashed again today, again during maximum load for the day.
This time, no errors in the twisted log, and not even a segfault message in 
/var/log/messages : the pid simply ceased to exist.
Once again, this machine runs 8 twisted servers, but this one is slightly 
different from the others, and the error happened after the code for this 
server has been slightly modified.
I do not think this is the bad RAM anymore, because there's one particular 
server that keeps crashing on this machine.

> -----Original Message-----
> From: twisted-python-boun...@twistedmatrix.com [mailto:twisted-python-
> boun...@twistedmatrix.com] On Behalf Of gl...@divmod.com
> Sent: Monday, March 23, 2009 1:37 AM
> To: 'Twisted general discussion'
> Subject: RE: [Twisted-Python] strange server crash
> 
> 
> On 07:25 am, matu...@yahoo.com wrote:
> >Very strange. I am not using any custom C extensions...
> >In the last two days, it has been under larger load, and it has not
> >crashed.
> >I will update to Python 2.6 soon.
> 
> Have you tested for bad RAM on that server?  The error mode is
> sufficiently weird and rare to make me suspect cosmic rays.
> 
> _______________________________________________
> Twisted-Python mailing list
> Twisted-Python@twistedmatrix.com
> http://twistedmatrix.com/cgi-bin/mailman/listinfo/twisted-python



_______________________________________________
Twisted-Python mailing list
Twisted-Python@twistedmatrix.com
http://twistedmatrix.com/cgi-bin/mailman/listinfo/twisted-python

Reply via email to