Kernel bug in 2.6.23...was: RE: How to debug a hung multi-core system....

2009-05-28 Thread Morrison, Tom
: tmorri...@empirix.com www.empirix.com >> -Original Message- >> From: Morrison, Tom >> Sent: Thursday, May 21, 2009 11:24 AM >> To: Morrison, Tom; Kumar Gala >> Cc: linuxppc-dev@ozlabs.org; Young, Andrew; Brown, Jeff; Geary Sean- >> R60898 >&g

RE: How to debug a hung multi-core system....

2009-05-21 Thread Morrison, Tom
Just had a little conference with several co-workers...to go over results We think that LT0 (the one that maps the kernel) has been corrupted: Entry EPN RPNTID TMASK WIMGE TSIZ U0:3 X0:1 --- LT0 C

RE: How to debug a hung multi-core system....

2009-05-21 Thread Morrison, Tom
org] >> Sent: Thursday, May 21, 2009 10:45 AM >> To: Morrison, Tom >> Cc: linuxppc-dev@ozlabs.org; Young, Andrew; Brown, Jeff; Geary Sean- >> R60898 >> Subject: Re: How to debug a hung multi-core system >> >> > [Morrison, Tom] >> > >B

RE: How to debug a hung multi-core system....

2009-05-21 Thread Morrison, Tom
>> -Original Message- >> From: Kumar Gala [mailto:ga...@kernel.crashing.org] >> Sent: Thursday, May 21, 2009 9:13 AM >> To: Morrison, Tom >> Cc: linuxppc-dev@ozlabs.org; Young, Andrew; Brown, Jeff >> Subject: Re: How to debug a hung multi-cor

Re: How to debug a hung multi-core system....

2009-05-21 Thread Kumar Gala
[Morrison, Tom] >BKM>tat Entry EPN RPNTID TMASK WIMGE TSIZ U0:3 X0:1 PID TS PROT SHEN UR UW UX SR SW SX TIDZ VAL LT0 C000 00 0FF 04 9 0 0 00PPEEDEEDDV LT1 D000

Re: How to debug a hung multi-core system....

2009-05-21 Thread Kumar Gala
On May 20, 2009, at 6:17 PM, Morrison, Tom wrote: All, First off, we turned SPE off completely in our build - so we could debug a much deeper problem that seems to be occurring in our application (before we try to find a potential test case for corruption of GPR registers). We have had this p