Re: 2.6.18-mm2 boot failure on x86-64

2006-10-17 Thread Mel Gorman
On Tue, 17 Oct 2006, Adrian Bunk wrote: On Mon, Oct 16, 2006 at 04:58:14PM -0700, Andrew Morton wrote: On Mon, 16 Oct 2006 14:16:13 -0400 Vivek Goyal <[EMAIL PROTECTED]> wrote: Can you please have a look at the attached patch Looks like a fine patch to me, although it could benefit from a

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-17 Thread Adrian Bunk
On Mon, Oct 16, 2006 at 04:58:14PM -0700, Andrew Morton wrote: > On Mon, 16 Oct 2006 14:16:13 -0400 > Vivek Goyal <[EMAIL PROTECTED]> wrote: > > > > > Can you please have a look at the attached patch > > Looks like a fine patch to me, although it could benefit from a comment > explaining why all

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-16 Thread Andrew Morton
On Mon, 16 Oct 2006 14:16:13 -0400 Vivek Goyal <[EMAIL PROTECTED]> wrote: > > Can you please have a look at the attached patch Looks like a fine patch to me, although it could benefit from a comment explaining why all those PAGE_ALIGN()s are in there. > and include it in -mm. Does it fix a pat

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-16 Thread Vivek Goyal
On Mon, Oct 09, 2006 at 10:53:58AM +0100, Mel Gorman wrote: > On Fri, 6 Oct 2006, Vivek Goyal wrote: > > >On Fri, Oct 06, 2006 at 01:03:50PM -0500, Steve Fox wrote: > >>On Fri, 2006-10-06 at 18:11 +0100, Mel Gorman wrote: > >>>On (06/10/06 11:36), Vivek Goyal didst pronounce: > Where is bss pl

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-09 Thread Mel Gorman
On Fri, 6 Oct 2006, Vivek Goyal wrote: On Fri, Oct 06, 2006 at 01:03:50PM -0500, Steve Fox wrote: On Fri, 2006-10-06 at 18:11 +0100, Mel Gorman wrote: On (06/10/06 11:36), Vivek Goyal didst pronounce: Where is bss placed in physical memory? I guess bss_start and bss_stop from System.map will

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-06 Thread Vivek Goyal
On Fri, Oct 06, 2006 at 01:03:50PM -0500, Steve Fox wrote: > On Fri, 2006-10-06 at 18:11 +0100, Mel Gorman wrote: > > On (06/10/06 11:36), Vivek Goyal didst pronounce: > > > Where is bss placed in physical memory? I guess bss_start and bss_stop > > > from System.map will tell us. That will confirm

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-06 Thread Steve Fox
On Fri, 2006-10-06 at 18:11 +0100, Mel Gorman wrote: > On (06/10/06 11:36), Vivek Goyal didst pronounce: > > Where is bss placed in physical memory? I guess bss_start and bss_stop > > from System.map will tell us. That will confirm that above memset step is > > stomping over bss. Then we have to ju

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-06 Thread Vivek Goyal
On Fri, Oct 06, 2006 at 06:11:05PM +0100, Mel Gorman wrote: > On (06/10/06 11:36), Vivek Goyal didst pronounce: > > On Fri, Oct 06, 2006 at 03:33:12PM +0100, Mel Gorman wrote: > > > > Linux version 2.6.18-git22 ([EMAIL PROTECTED]) (gcc version 4.1.0 (SUSE > > > > Linux)) #2 SMP Thu Oct 5 19:05:36

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-06 Thread Vivek Goyal
On Fri, Oct 06, 2006 at 06:11:05PM +0100, Mel Gorman wrote: > On (06/10/06 11:36), Vivek Goyal didst pronounce: > > On Fri, Oct 06, 2006 at 03:33:12PM +0100, Mel Gorman wrote: > > > > Linux version 2.6.18-git22 ([EMAIL PROTECTED]) (gcc version 4.1.0 (SUSE > > > > Linux)) #2 SMP Thu Oct 5 19:05:36

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-06 Thread Mel Gorman
On (06/10/06 11:36), Vivek Goyal didst pronounce: > On Fri, Oct 06, 2006 at 03:33:12PM +0100, Mel Gorman wrote: > > > Linux version 2.6.18-git22 ([EMAIL PROTECTED]) (gcc version 4.1.0 (SUSE > > > Linux)) #2 SMP Thu Oct 5 19:05:36 PDT 2006 > > > Command line: root=/dev/sda1 vga=791 > > > ip=9.47.

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-06 Thread Vivek Goyal
On Fri, Oct 06, 2006 at 03:33:12PM +0100, Mel Gorman wrote: > > Linux version 2.6.18-git22 ([EMAIL PROTECTED]) (gcc version 4.1.0 (SUSE > > Linux)) #2 SMP Thu Oct 5 19:05:36 PDT 2006 > > Command line: root=/dev/sda1 vga=791 > > ip=9.47.67.239:9.47.67.50:9.47.67.1:255.255.255.0 resume=/dev/sdb1 s

Re: 2.6.18-mm2 boot failure on x86-64 II

2006-10-05 Thread Andrew Morton
On Thu, 05 Oct 2006 17:02:54 -0700 Badari Pulavarty <[EMAIL PROTECTED]> wrote: > > Code: 0f 0b 48 8b 3d 15 ab 1e 00 be d0 00 00 00 e8 c0 f5 ff ff 48 > > RIP [] init_list+0x1d/0xfd > > RSP > > <0>Kernel panic - not syncing: Attempted to kill the idle task! > > > > > > I am going to revert the p

Re: 2.6.18-mm2 boot failure on x86-64 II

2006-10-05 Thread Badari Pulavarty
keith mannthey wrote: On Fri, 2006-10-06 at 01:35 +0200, Andi Kleen wrote: As of yet I haven't been able to recreate the hang. I am running similar HW to Steve. I ran into this with -mm3 Memory: 24150368k/26738688k available (1933k kernel code, 490260k reserved, 978k data, 308k in

Re: 2.6.18-mm2 boot failure on x86-64 II

2006-10-05 Thread keith mannthey
On Fri, 2006-10-06 at 01:35 +0200, Andi Kleen wrote: > > As of yet I haven't been able to recreate the hang. I am running > > similar HW to Steve. I ran into this with -mm3 Memory: 24150368k/26738688k available (1933k kernel code, 490260k reserved, 978k data, 308k init) [ cut here ]

Re: 2.6.18-mm2 boot failure on x86-64 II

2006-10-05 Thread Andi Kleen
> As of yet I haven't been able to recreate the hang. I am running > similar HW to Steve. That was on a 4 core Opteron with Tyan board (S2881) and AMD-8111 chipset. -Andi - To unsubscribe from this list: send the line "unsubscribe netdev" in the body of a message to [EMAIL PROTECTED] More ma

Re: 2.6.18-mm2 boot failure on x86-64 II

2006-10-05 Thread keith mannthey
On Fri, 2006-10-06 at 01:14 +0200, Andi Kleen wrote: > On Thursday 05 October 2006 22:51, Andi Kleen wrote: > > > > > hmm, rather than bugging you with patches now, I'll see what I can find > > > with the x86_64 machines I have access to and see can I reproduce it. > > > > I started the bisect,

Re: 2.6.18-mm2 boot failure on x86-64 II

2006-10-05 Thread Andi Kleen
On Thursday 05 October 2006 22:51, Andi Kleen wrote: > > > hmm, rather than bugging you with patches now, I'll see what I can find > > with the x86_64 machines I have access to and see can I reproduce it. > > I started the bisect, should finish soon. It ended at diff-tree d5cdb67236dba94496de

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-05 Thread Andi Kleen
> hmm, rather than bugging you with patches now, I'll see what I can find > with the x86_64 machines I have access to and see can I reproduce it. I started the bisect, should finish soon. -Andi - To unsubscribe from this list: send the line "unsubscribe netdev" in the body of a message to [EMAI

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-05 Thread Andi Kleen
On Thursday 05 October 2006 22:42, Steve Fox wrote: > On Thu, 2006-10-05 at 21:05 +0200, Andi Kleen wrote: > > > Can you please try it again with this patch to narrow it down further? > > Unfortunately this is as far as it got before it hung. Boot with earlyprintk=serial,ttyS0,57600 (or change t

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-05 Thread Steve Fox
On Thu, 2006-10-05 at 21:05 +0200, Andi Kleen wrote: > Can you please try it again with this patch to narrow it down further? Unfortunately this is as far as it got before it hung. root (hd0,0) Filesystem type is reiserfs, partition type 0x83 kernel /boot/vmlinuz-autobench root=/dev/sda1 vga=79

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-05 Thread Mel Gorman
On Thu, 5 Oct 2006, Andi Kleen wrote: On Thursday 05 October 2006 20:52, Vivek Goyal wrote: On Thu, Oct 05, 2006 at 08:27:02PM +0200, Andi Kleen wrote: On Thursday 05 October 2006 19:57, Steve Fox wrote: On Thu, 2006-10-05 at 17:40 +0200, Andi Kleen wrote: Please don't snip the Code: line.

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-05 Thread Steve Fox
On Thu, 2006-10-05 at 21:08 +0200, Andi Kleen wrote: > Mel might want to take a look (and perhaps > also cut down a little on the ugly printks ...) I tested a patch from Mel which backs out the arch independent zone sizing and got the same results (to my inexperienced eye). I've sent him the boo

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-05 Thread Andi Kleen
On Thursday 05 October 2006 20:52, Vivek Goyal wrote: > On Thu, Oct 05, 2006 at 08:27:02PM +0200, Andi Kleen wrote: > > On Thursday 05 October 2006 19:57, Steve Fox wrote: > > > On Thu, 2006-10-05 at 17:40 +0200, Andi Kleen wrote: > > > > > > > Please don't snip the Code: line. It is fairly import

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-05 Thread Andi Kleen
On Thursday 05 October 2006 20:51, Steve Fox wrote: > On Thu, 2006-10-05 at 20:27 +0200, Andi Kleen wrote: > > > I guess we need to track when it gets corrupted. Can you send the full > > boot log with this patch applied? > > Here she blows! Can you please try it again with this patch to narrow

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-05 Thread Steve Fox
On Thu, 2006-10-05 at 20:27 +0200, Andi Kleen wrote: > I guess we need to track when it gets corrupted. Can you send the full > boot log with this patch applied? Here she blows! root (hd0,0) Filesystem type is reiserfs, partition type 0x83 kernel /boot/vmlinuz-autobench root=/dev/sda1 vga=791 i

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-05 Thread Vivek Goyal
On Thu, Oct 05, 2006 at 08:27:02PM +0200, Andi Kleen wrote: > On Thursday 05 October 2006 19:57, Steve Fox wrote: > > On Thu, 2006-10-05 at 17:40 +0200, Andi Kleen wrote: > > > > > Please don't snip the Code: line. It is fairly important. > > > > Sorry about that. The remote console I was using a

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-05 Thread Andi Kleen
On Thursday 05 October 2006 19:57, Steve Fox wrote: > On Thu, 2006-10-05 at 17:40 +0200, Andi Kleen wrote: > > > Please don't snip the Code: line. It is fairly important. > > Sorry about that. The remote console I was using appears to overwrite > some text after I force the reboot. Here's a clean

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-05 Thread Steve Fox
On Thu, 2006-10-05 at 17:40 +0200, Andi Kleen wrote: > Please don't snip the Code: line. It is fairly important. Sorry about that. The remote console I was using appears to overwrite some text after I force the reboot. Here's a clean one. global Unable to handle kernel NULL poin

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-05 Thread Andi Kleen
On Thursday 05 October 2006 17:32, Steve Fox wrote: > On Thu, 2006-10-05 at 08:12 -0700, Badari Pulavarty wrote: > > > Can you post the latest panic stack again (with CONFIG_DEBUG_KERNEL) ? > > CONFIG_DEBUG_KERNEL should be on > > > Last time I couldn't match your instruction dump to any code s

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-05 Thread Steve Fox
On Thu, 2006-10-05 at 08:12 -0700, Badari Pulavarty wrote: > Can you post the latest panic stack again (with CONFIG_DEBUG_KERNEL) ? CONFIG_DEBUG_KERNEL should be on > Last time I couldn't match your instruction dump to any code segment > in the routine. And also, can you post your .config file.

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-05 Thread Badari Pulavarty
On Thu, 2006-10-05 at 09:53 -0500, Steve Fox wrote: > On Wed, 2006-10-04 at 18:08 -0700, Martin Bligh wrote: > > Andi Kleen wrote: > > >>I think most likely it would crash on 2.6.18. Keith mannthey had reported > > >>a different crash on 2.6.18-rc4-mm2 when this patch was introduced first > > >>tim

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-05 Thread Steve Fox
On Wed, 2006-10-04 at 18:08 -0700, Martin Bligh wrote: > Andi Kleen wrote: > >>I think most likely it would crash on 2.6.18. Keith mannthey had reported > >>a different crash on 2.6.18-rc4-mm2 when this patch was introduced first > >>time. Following is the link to the thread. > > > > > > Then may

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-04 Thread Keith Mannthey
On 10/4/06, Martin Bligh <[EMAIL PROTECTED]> wrote: Andi Kleen wrote: >>I think most likely it would crash on 2.6.18. Keith mannthey had reported >>a different crash on 2.6.18-rc4-mm2 when this patch was introduced first >>time. Following is the link to the thread. > > > Then maybe trying 2.6.17

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-04 Thread Keith Mannthey
On 10/4/06, Andi Kleen <[EMAIL PROTECTED]> wrote: On Wednesday 04 October 2006 17:45, Andrew Morton wrote: > On Wed, 04 Oct 2006 08:42:28 -0500 > Steve Fox <[EMAIL PROTECTED]> wrote: > > > On Thu, 2006-09-28 at 14:01 -0700, Andrew Morton wrote: > > > On Thu, 28 Sep 2006 17:50:31 + (UTC) > > >

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-04 Thread Martin Bligh
Andi Kleen wrote: I think most likely it would crash on 2.6.18. Keith mannthey had reported a different crash on 2.6.18-rc4-mm2 when this patch was introduced first time. Following is the link to the thread. Then maybe trying 2.6.17 + the patch and then bisect between that and -rc4? I think

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-04 Thread Andi Kleen
> I think most likely it would crash on 2.6.18. Keith mannthey had reported > a different crash on 2.6.18-rc4-mm2 when this patch was introduced first > time. Following is the link to the thread. Then maybe trying 2.6.17 + the patch and then bisect between that and -rc4? -Andi - To unsubscribe f

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-04 Thread Vivek Goyal
On Wed, Oct 04, 2006 at 05:06:59PM -0700, Andrew Morton wrote: > On Wed, 04 Oct 2006 11:41:59 -0500 > Steve Fox <[EMAIL PROTECTED]> wrote: > > > On Wed, 2006-10-04 at 08:45 -0700, Andrew Morton wrote: > > > On Wed, 04 Oct 2006 08:42:28 -0500 > > > Steve Fox <[EMAIL PROTECTED]> wrote: > > > > Sorry

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-04 Thread Andrew Morton
On Wed, 04 Oct 2006 11:41:59 -0500 Steve Fox <[EMAIL PROTECTED]> wrote: > On Wed, 2006-10-04 at 08:45 -0700, Andrew Morton wrote: > > On Wed, 04 Oct 2006 08:42:28 -0500 > > Steve Fox <[EMAIL PROTECTED]> wrote: > > > Sorry for the delay. I was finally able to perform a bisect on this. It > > > turn

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-04 Thread Steve Fox
On Wed, 2006-10-04 at 08:45 -0700, Andrew Morton wrote: > On Wed, 04 Oct 2006 08:42:28 -0500 > Steve Fox <[EMAIL PROTECTED]> wrote: > > Sorry for the delay. I was finally able to perform a bisect on this. It > > turns out the patch that causes this is > > x86_64-mm-re-positioning-the-bss-segment.pa

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-04 Thread Vivek Goyal
On Wed, Oct 04, 2006 at 08:45:40AM -0700, Andrew Morton wrote: > On Wed, 04 Oct 2006 08:42:28 -0500 > Steve Fox <[EMAIL PROTECTED]> wrote: > > > On Thu, 2006-09-28 at 14:01 -0700, Andrew Morton wrote: > > > On Thu, 28 Sep 2006 17:50:31 + (UTC) > > > "Steve Fox" <[EMAIL PROTECTED]> wrote: > > >

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-04 Thread Andi Kleen
On Wednesday 04 October 2006 17:45, Andrew Morton wrote: > On Wed, 04 Oct 2006 08:42:28 -0500 > Steve Fox <[EMAIL PROTECTED]> wrote: > > > On Thu, 2006-09-28 at 14:01 -0700, Andrew Morton wrote: > > > On Thu, 28 Sep 2006 17:50:31 + (UTC) > > > "Steve Fox" <[EMAIL PROTECTED]> wrote: > > > > >

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-04 Thread Andrew Morton
On Wed, 04 Oct 2006 08:42:28 -0500 Steve Fox <[EMAIL PROTECTED]> wrote: > On Thu, 2006-09-28 at 14:01 -0700, Andrew Morton wrote: > > On Thu, 28 Sep 2006 17:50:31 + (UTC) > > "Steve Fox" <[EMAIL PROTECTED]> wrote: > > > > > On Thu, 28 Sep 2006 01:46:23 -0700, Andrew Morton wrote: > > > > > >

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-04 Thread Steve Fox
On Thu, 2006-09-28 at 14:01 -0700, Andrew Morton wrote: > On Thu, 28 Sep 2006 17:50:31 + (UTC) > "Steve Fox" <[EMAIL PROTECTED]> wrote: > > > On Thu, 28 Sep 2006 01:46:23 -0700, Andrew Morton wrote: > > > > > ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.18/2.6.18-mm2/ > >