Re: 2.6.18-mm2 boot failure on x86-64

2006-10-17 Thread Mel Gorman
On Tue, 17 Oct 2006, Adrian Bunk wrote: On Mon, Oct 16, 2006 at 04:58:14PM -0700, Andrew Morton wrote: On Mon, 16 Oct 2006 14:16:13 -0400 Vivek Goyal <[EMAIL PROTECTED]> wrote: Can you please have a look at the attached patch Looks like a fine patch to me, although it could benefit from a

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-17 Thread Adrian Bunk
On Mon, Oct 16, 2006 at 04:58:14PM -0700, Andrew Morton wrote: > On Mon, 16 Oct 2006 14:16:13 -0400 > Vivek Goyal <[EMAIL PROTECTED]> wrote: > > > > > Can you please have a look at the attached patch > > Looks like a fine patch to me, although it could benefit from a comment > explaining why all

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-16 Thread Andrew Morton
On Mon, 16 Oct 2006 14:16:13 -0400 Vivek Goyal <[EMAIL PROTECTED]> wrote: > > Can you please have a look at the attached patch Looks like a fine patch to me, although it could benefit from a comment explaining why all those PAGE_ALIGN()s are in there. > and include it in -mm. Does it fix a pat

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-16 Thread Vivek Goyal
On Mon, Oct 09, 2006 at 10:53:58AM +0100, Mel Gorman wrote: > On Fri, 6 Oct 2006, Vivek Goyal wrote: > > >On Fri, Oct 06, 2006 at 01:03:50PM -0500, Steve Fox wrote: > >>On Fri, 2006-10-06 at 18:11 +0100, Mel Gorman wrote: > >>>On (06/10/06 11:36), Vivek Goyal didst pronounce: > Where is bss pl

Re: sky2 (was Re: 2.6.18-mm2)

2006-10-09 Thread Beber
Le Sun, 8 Oct 2006 09:20:01 -0700, Stephen Hemminger <[EMAIL PROTECTED]> a écrit : > On Sun, 08 Oct 2006 15:41:45 +0200 > Matthias Hentges <[EMAIL PROTECTED]> wrote: > > > Hi Stephen, > > > > I believe I have identified the problem. The freeze only happens when > > your debug patch to work aroun

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-09 Thread Mel Gorman
On Fri, 6 Oct 2006, Vivek Goyal wrote: On Fri, Oct 06, 2006 at 01:03:50PM -0500, Steve Fox wrote: On Fri, 2006-10-06 at 18:11 +0100, Mel Gorman wrote: On (06/10/06 11:36), Vivek Goyal didst pronounce: Where is bss placed in physical memory? I guess bss_start and bss_stop from System.map will

Re: sky2 (was Re: 2.6.18-mm2)

2006-10-08 Thread Matthias Hentges
Am Sonntag, den 08.10.2006, 09:20 -0700 schrieb Stephen Hemminger: > On Sun, 08 Oct 2006 15:41:45 +0200 > Matthias Hentges <[EMAIL PROTECTED]> wrote: [...] > > I have verified this behavior (works fine w/o debug patch, freezes with > > patch applied) with: > > - 2.6.19-rc1-git4 > > - 2.6.18-git

Re: sky2 (was Re: 2.6.18-mm2)

2006-10-08 Thread Stephen Hemminger
On Sun, 08 Oct 2006 15:41:45 +0200 Matthias Hentges <[EMAIL PROTECTED]> wrote: > Hi Stephen, > > I believe I have identified the problem. The freeze only happens when > your debug patch to work around sky2 PCIe error messages is applied. > Without your patch (attached) I get _tons_ of error messa

Re: sky2 (was Re: 2.6.18-mm2)

2006-10-08 Thread Matthias Hentges
Hi Stephen, I believe I have identified the problem. The freeze only happens when your debug patch to work around sky2 PCIe error messages is applied. Without your patch (attached) I get _tons_ of error messages and the NIC dies every few seconds / minutes (reproduceable!), but the system recovers

Re: sky2 (was Re: 2.6.18-mm2)

2006-10-07 Thread Matthias Hentges
sky2 related since the freeze can be > > triggered by continuous network traffic (like playing a movie over NFS > > etc.). > > When it fixes what does the log say. I'm probably going to back out > the PCI express extended error using the pci_XXX functions. > > The

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-06 Thread Vivek Goyal
On Fri, Oct 06, 2006 at 01:03:50PM -0500, Steve Fox wrote: > On Fri, 2006-10-06 at 18:11 +0100, Mel Gorman wrote: > > On (06/10/06 11:36), Vivek Goyal didst pronounce: > > > Where is bss placed in physical memory? I guess bss_start and bss_stop > > > from System.map will tell us. That will confirm

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-06 Thread Steve Fox
On Fri, 2006-10-06 at 18:11 +0100, Mel Gorman wrote: > On (06/10/06 11:36), Vivek Goyal didst pronounce: > > Where is bss placed in physical memory? I guess bss_start and bss_stop > > from System.map will tell us. That will confirm that above memset step is > > stomping over bss. Then we have to ju

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-06 Thread Vivek Goyal
On Fri, Oct 06, 2006 at 06:11:05PM +0100, Mel Gorman wrote: > On (06/10/06 11:36), Vivek Goyal didst pronounce: > > On Fri, Oct 06, 2006 at 03:33:12PM +0100, Mel Gorman wrote: > > > > Linux version 2.6.18-git22 ([EMAIL PROTECTED]) (gcc version 4.1.0 (SUSE > > > > Linux)) #2 SMP Thu Oct 5 19:05:36

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-06 Thread Vivek Goyal
On Fri, Oct 06, 2006 at 06:11:05PM +0100, Mel Gorman wrote: > On (06/10/06 11:36), Vivek Goyal didst pronounce: > > On Fri, Oct 06, 2006 at 03:33:12PM +0100, Mel Gorman wrote: > > > > Linux version 2.6.18-git22 ([EMAIL PROTECTED]) (gcc version 4.1.0 (SUSE > > > > Linux)) #2 SMP Thu Oct 5 19:05:36

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-06 Thread Mel Gorman
On (06/10/06 11:36), Vivek Goyal didst pronounce: > On Fri, Oct 06, 2006 at 03:33:12PM +0100, Mel Gorman wrote: > > > Linux version 2.6.18-git22 ([EMAIL PROTECTED]) (gcc version 4.1.0 (SUSE > > > Linux)) #2 SMP Thu Oct 5 19:05:36 PDT 2006 > > > Command line: root=/dev/sda1 vga=791 > > > ip=9.47.

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-06 Thread Vivek Goyal
On Fri, Oct 06, 2006 at 03:33:12PM +0100, Mel Gorman wrote: > > Linux version 2.6.18-git22 ([EMAIL PROTECTED]) (gcc version 4.1.0 (SUSE > > Linux)) #2 SMP Thu Oct 5 19:05:36 PDT 2006 > > Command line: root=/dev/sda1 vga=791 > > ip=9.47.67.239:9.47.67.50:9.47.67.1:255.255.255.0 resume=/dev/sdb1 s

Re: 2.6.18-mm2 boot failure on x86-64 II

2006-10-05 Thread Andrew Morton
On Thu, 05 Oct 2006 17:02:54 -0700 Badari Pulavarty <[EMAIL PROTECTED]> wrote: > > Code: 0f 0b 48 8b 3d 15 ab 1e 00 be d0 00 00 00 e8 c0 f5 ff ff 48 > > RIP [] init_list+0x1d/0xfd > > RSP > > <0>Kernel panic - not syncing: Attempted to kill the idle task! > > > > > > I am going to revert the p

Re: 2.6.18-mm2 boot failure on x86-64 II

2006-10-05 Thread Badari Pulavarty
keith mannthey wrote: On Fri, 2006-10-06 at 01:35 +0200, Andi Kleen wrote: As of yet I haven't been able to recreate the hang. I am running similar HW to Steve. I ran into this with -mm3 Memory: 24150368k/26738688k available (1933k kernel code, 490260k reserved, 978k data, 308k in

Re: 2.6.18-mm2 boot failure on x86-64 II

2006-10-05 Thread keith mannthey
On Fri, 2006-10-06 at 01:35 +0200, Andi Kleen wrote: > > As of yet I haven't been able to recreate the hang. I am running > > similar HW to Steve. I ran into this with -mm3 Memory: 24150368k/26738688k available (1933k kernel code, 490260k reserved, 978k data, 308k init) [ cut here ]

Re: 2.6.18-mm2 boot failure on x86-64 II

2006-10-05 Thread Andi Kleen
> As of yet I haven't been able to recreate the hang. I am running > similar HW to Steve. That was on a 4 core Opteron with Tyan board (S2881) and AMD-8111 chipset. -Andi - To unsubscribe from this list: send the line "unsubscribe netdev" in the body of a message to [EMAIL PROTECTED] More ma

Re: 2.6.18-mm2 boot failure on x86-64 II

2006-10-05 Thread keith mannthey
On Fri, 2006-10-06 at 01:14 +0200, Andi Kleen wrote: > On Thursday 05 October 2006 22:51, Andi Kleen wrote: > > > > > hmm, rather than bugging you with patches now, I'll see what I can find > > > with the x86_64 machines I have access to and see can I reproduce it. > > > > I started the bisect,

Re: 2.6.18-mm2 boot failure on x86-64 II

2006-10-05 Thread Andi Kleen
On Thursday 05 October 2006 22:51, Andi Kleen wrote: > > > hmm, rather than bugging you with patches now, I'll see what I can find > > with the x86_64 machines I have access to and see can I reproduce it. > > I started the bisect, should finish soon. It ended at diff-tree d5cdb67236dba94496de

Re: 2.6.18-mm2 - oops in cache_alloc_refill()

2006-10-05 Thread Jean Tourrilhes
On Thu, Oct 05, 2006 at 06:37:53PM -0400, Pavel Roskin wrote: > Hello! > > On Tue, 2006-10-03 at 09:34 -0700, Jean Tourrilhes wrote: > > I don't really want to overstep my authority there, my goal > > was to minimise the changes. Pavel will have to clean up my mess, so I > > don't want change

Re: 2.6.18-mm2 - oops in cache_alloc_refill()

2006-10-05 Thread Pavel Roskin
Hello! On Tue, 2006-10-03 at 09:34 -0700, Jean Tourrilhes wrote: > I don't really want to overstep my authority there, my goal > was to minimise the changes. Pavel will have to clean up my mess, so I > don't want change things too much. Sorry for a long delay. I'm actually not very interes

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-05 Thread Andi Kleen
> hmm, rather than bugging you with patches now, I'll see what I can find > with the x86_64 machines I have access to and see can I reproduce it. I started the bisect, should finish soon. -Andi - To unsubscribe from this list: send the line "unsubscribe netdev" in the body of a message to [EMAI

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-05 Thread Andi Kleen
On Thursday 05 October 2006 22:42, Steve Fox wrote: > On Thu, 2006-10-05 at 21:05 +0200, Andi Kleen wrote: > > > Can you please try it again with this patch to narrow it down further? > > Unfortunately this is as far as it got before it hung. Boot with earlyprintk=serial,ttyS0,57600 (or change t

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-05 Thread Steve Fox
On Thu, 2006-10-05 at 21:05 +0200, Andi Kleen wrote: > Can you please try it again with this patch to narrow it down further? Unfortunately this is as far as it got before it hung. root (hd0,0) Filesystem type is reiserfs, partition type 0x83 kernel /boot/vmlinuz-autobench root=/dev/sda1 vga=79

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-05 Thread Mel Gorman
On Thu, 5 Oct 2006, Andi Kleen wrote: On Thursday 05 October 2006 20:52, Vivek Goyal wrote: On Thu, Oct 05, 2006 at 08:27:02PM +0200, Andi Kleen wrote: On Thursday 05 October 2006 19:57, Steve Fox wrote: On Thu, 2006-10-05 at 17:40 +0200, Andi Kleen wrote: Please don't snip the Code: line.

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-05 Thread Steve Fox
On Thu, 2006-10-05 at 21:08 +0200, Andi Kleen wrote: > Mel might want to take a look (and perhaps > also cut down a little on the ugly printks ...) I tested a patch from Mel which backs out the arch independent zone sizing and got the same results (to my inexperienced eye). I've sent him the boo

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-05 Thread Andi Kleen
On Thursday 05 October 2006 20:52, Vivek Goyal wrote: > On Thu, Oct 05, 2006 at 08:27:02PM +0200, Andi Kleen wrote: > > On Thursday 05 October 2006 19:57, Steve Fox wrote: > > > On Thu, 2006-10-05 at 17:40 +0200, Andi Kleen wrote: > > > > > > > Please don't snip the Code: line. It is fairly import

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-05 Thread Andi Kleen
On Thursday 05 October 2006 20:51, Steve Fox wrote: > On Thu, 2006-10-05 at 20:27 +0200, Andi Kleen wrote: > > > I guess we need to track when it gets corrupted. Can you send the full > > boot log with this patch applied? > > Here she blows! Can you please try it again with this patch to narrow

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-05 Thread Steve Fox
On Thu, 2006-10-05 at 20:27 +0200, Andi Kleen wrote: > I guess we need to track when it gets corrupted. Can you send the full > boot log with this patch applied? Here she blows! root (hd0,0) Filesystem type is reiserfs, partition type 0x83 kernel /boot/vmlinuz-autobench root=/dev/sda1 vga=791 i

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-05 Thread Vivek Goyal
On Thu, Oct 05, 2006 at 08:27:02PM +0200, Andi Kleen wrote: > On Thursday 05 October 2006 19:57, Steve Fox wrote: > > On Thu, 2006-10-05 at 17:40 +0200, Andi Kleen wrote: > > > > > Please don't snip the Code: line. It is fairly important. > > > > Sorry about that. The remote console I was using a

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-05 Thread Andi Kleen
On Thursday 05 October 2006 19:57, Steve Fox wrote: > On Thu, 2006-10-05 at 17:40 +0200, Andi Kleen wrote: > > > Please don't snip the Code: line. It is fairly important. > > Sorry about that. The remote console I was using appears to overwrite > some text after I force the reboot. Here's a clean

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-05 Thread Steve Fox
On Thu, 2006-10-05 at 17:40 +0200, Andi Kleen wrote: > Please don't snip the Code: line. It is fairly important. Sorry about that. The remote console I was using appears to overwrite some text after I force the reboot. Here's a clean one. global Unable to handle kernel NULL poin

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-05 Thread Andi Kleen
On Thursday 05 October 2006 17:32, Steve Fox wrote: > On Thu, 2006-10-05 at 08:12 -0700, Badari Pulavarty wrote: > > > Can you post the latest panic stack again (with CONFIG_DEBUG_KERNEL) ? > > CONFIG_DEBUG_KERNEL should be on > > > Last time I couldn't match your instruction dump to any code s

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-05 Thread Steve Fox
On Thu, 2006-10-05 at 08:12 -0700, Badari Pulavarty wrote: > Can you post the latest panic stack again (with CONFIG_DEBUG_KERNEL) ? CONFIG_DEBUG_KERNEL should be on > Last time I couldn't match your instruction dump to any code segment > in the routine. And also, can you post your .config file.

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-05 Thread Badari Pulavarty
On Thu, 2006-10-05 at 09:53 -0500, Steve Fox wrote: > On Wed, 2006-10-04 at 18:08 -0700, Martin Bligh wrote: > > Andi Kleen wrote: > > >>I think most likely it would crash on 2.6.18. Keith mannthey had reported > > >>a different crash on 2.6.18-rc4-mm2 when this patch was introduced first > > >>tim

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-05 Thread Steve Fox
On Wed, 2006-10-04 at 18:08 -0700, Martin Bligh wrote: > Andi Kleen wrote: > >>I think most likely it would crash on 2.6.18. Keith mannthey had reported > >>a different crash on 2.6.18-rc4-mm2 when this patch was introduced first > >>time. Following is the link to the thread. > > > > > > Then may

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-04 Thread Keith Mannthey
On 10/4/06, Martin Bligh <[EMAIL PROTECTED]> wrote: Andi Kleen wrote: >>I think most likely it would crash on 2.6.18. Keith mannthey had reported >>a different crash on 2.6.18-rc4-mm2 when this patch was introduced first >>time. Following is the link to the thread. > > > Then maybe trying 2.6.17

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-04 Thread Keith Mannthey
Thu, 28 Sep 2006 17:50:31 + (UTC) > > > "Steve Fox" <[EMAIL PROTECTED]> wrote: > > > > > > > On Thu, 28 Sep 2006 01:46:23 -0700, Andrew Morton wrote: > > > > > > > > > ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/pat

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-04 Thread Martin Bligh
Andi Kleen wrote: I think most likely it would crash on 2.6.18. Keith mannthey had reported a different crash on 2.6.18-rc4-mm2 when this patch was introduced first time. Following is the link to the thread. Then maybe trying 2.6.17 + the patch and then bisect between that and -rc4? I think

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-04 Thread Andi Kleen
> I think most likely it would crash on 2.6.18. Keith mannthey had reported > a different crash on 2.6.18-rc4-mm2 when this patch was introduced first > time. Following is the link to the thread. Then maybe trying 2.6.17 + the patch and then bisect between that and -rc4? -Andi - To unsubscribe f

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-04 Thread Vivek Goyal
On Wed, Oct 04, 2006 at 05:06:59PM -0700, Andrew Morton wrote: > On Wed, 04 Oct 2006 11:41:59 -0500 > Steve Fox <[EMAIL PROTECTED]> wrote: > > > On Wed, 2006-10-04 at 08:45 -0700, Andrew Morton wrote: > > > On Wed, 04 Oct 2006 08:42:28 -0500 > > > Steve Fox <[EMAIL PROTECTED]> wrote: > > > > Sorry

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-04 Thread Andrew Morton
On Wed, 04 Oct 2006 11:41:59 -0500 Steve Fox <[EMAIL PROTECTED]> wrote: > On Wed, 2006-10-04 at 08:45 -0700, Andrew Morton wrote: > > On Wed, 04 Oct 2006 08:42:28 -0500 > > Steve Fox <[EMAIL PROTECTED]> wrote: > > > Sorry for the delay. I was finally able to perform a bisect on this. It > > > turn

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-04 Thread Steve Fox
On Wed, 2006-10-04 at 08:45 -0700, Andrew Morton wrote: > On Wed, 04 Oct 2006 08:42:28 -0500 > Steve Fox <[EMAIL PROTECTED]> wrote: > > Sorry for the delay. I was finally able to perform a bisect on this. It > > turns out the patch that causes this is > > x86_64-mm-re-positioning-the-bss-segment.pa

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-04 Thread Vivek Goyal
; "Steve Fox" <[EMAIL PROTECTED]> wrote: > > > > > > > On Thu, 28 Sep 2006 01:46:23 -0700, Andrew Morton wrote: > > > > > > > > > ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.18/2.6.18-mm2/ > > > > > &g

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-04 Thread Andi Kleen
; "Steve Fox" <[EMAIL PROTECTED]> wrote: > > > > > > > On Thu, 28 Sep 2006 01:46:23 -0700, Andrew Morton wrote: > > > > > > > > > ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.18/2.6.18-mm2/ > > > > > &g

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-04 Thread Andrew Morton
Thu, 28 Sep 2006 01:46:23 -0700, Andrew Morton wrote: > > > > > > > ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.18/2.6.18-mm2/ > > > > > > Panic on boot. This machine booted 2.6.18-mm1 fine. em64t machine. > > > > > > TCP b

Re: 2.6.18-mm2 boot failure on x86-64

2006-10-04 Thread Steve Fox
On Thu, 2006-09-28 at 14:01 -0700, Andrew Morton wrote: > On Thu, 28 Sep 2006 17:50:31 + (UTC) > "Steve Fox" <[EMAIL PROTECTED]> wrote: > > > On Thu, 28 Sep 2006 01:46:23 -0700, Andrew Morton wrote: > > > > > ftp://ftp.kernel.org/pub/linux/ke

Re: sky2 (was Re: 2.6.18-mm2)

2006-10-03 Thread Stephen Hemminger
of the box in > question. I believe they are sky2 related since the freeze can be > triggered by continuous network traffic (like playing a movie over NFS > etc.). When it fixes what does the log say. I'm probably going to back out the PCI express extended error using the pci_XXX functi

Re: sky2 (was Re: 2.6.18-mm2)

2006-10-03 Thread Matthias Hentges
since seen random but reproduceable freezes of the box in question. I believe they are sky2 related since the freeze can be triggered by continuous network traffic (like playing a movie over NFS etc.). The freezes only happen with 2.6.18-mm2 and 2.6.18-mm3. 2.6.18-mm1 works perfectly fine. I'

Re: 2.6.18-mm2 - oops in cache_alloc_refill()

2006-10-03 Thread Jean Tourrilhes
On Tue, Oct 03, 2006 at 06:45:35PM +0200, Samuel Tardieu wrote: > On 3/10, Jean Tourrilhes wrote: > > | > I suggest that you revert the memset() to IW_ESSID_MAX_SIZE+1 so that > | > the last byte is cleared as well. Or am I missing something? > | > | No, that would bring back the slab/memory ove

Re: 2.6.18-mm2 - oops in cache_alloc_refill()

2006-10-03 Thread Samuel Tardieu
On 3/10, Jean Tourrilhes wrote: | > I suggest that you revert the memset() to IW_ESSID_MAX_SIZE+1 so that | > the last byte is cleared as well. Or am I missing something? | | No, that would bring back the slab/memory overflow we are | trying to get rid of. Then I am puzzled by the function decl

Re: 2.6.18-mm2 - oops in cache_alloc_refill()

2006-10-03 Thread Jean Tourrilhes
On Tue, Oct 03, 2006 at 05:58:31PM +0200, Samuel Tardieu wrote: > > "Jean" == Jean Tourrilhes <[EMAIL PROTECTED]> writes: > > Jean> @@ -2500,9 +2501,9 @@ static int orinoco_hw_get_essid(struct o > Jean> len = le16_to_cpu(essidbuf.len); > Jean> BUG_ON(len > IW_ESSID_MAX_SIZE); >

Re: 2.6.18-mm2 - oops in cache_alloc_refill()

2006-10-03 Thread Samuel Tardieu
> "Jean" == Jean Tourrilhes <[EMAIL PROTECTED]> writes: Jean> @@ -2500,9 +2501,9 @@ static int orinoco_hw_get_essid(struct o Jean> len = le16_to_cpu(essidbuf.len); Jean> BUG_ON(len > IW_ESSID_MAX_SIZE); Jean> Jean> - memset(buf, 0, IW_ESSID_MAX_SIZE+1); Jean> + memset(buf, 0, IW_ESSID_MA

Re: 2.6.18-mm2 - oops in cache_alloc_refill()

2006-10-02 Thread Valdis . Kletnieks
On Mon, 02 Oct 2006 10:52:45 PDT, Jean Tourrilhes said: > On Fri, Sep 29, 2006 at 06:20:08PM -0700, Andrew Morton wrote: > > On Fri, 29 Sep 2006 20:01:54 -0400 > > > > > > % grep ioctl /tmp/foo2 | sort -u | more > > > ioctl(13, SIOCGIWESSID, 0xbfbcdb9c) = 0 > > > ioctl(13, SIOCGIWRANGE, 0xbfbc

Re: 2.6.18-mm2 - oops in cache_alloc_refill()

2006-10-02 Thread Jean Tourrilhes
On Fri, Sep 29, 2006 at 06:20:08PM -0700, Andrew Morton wrote: > On Fri, 29 Sep 2006 20:01:54 -0400 > > > > % grep ioctl /tmp/foo2 | sort -u | more > > ioctl(13, SIOCGIWESSID, 0xbfbcdb9c) = 0 > > ioctl(13, SIOCGIWRANGE, 0xbfbcdbdc) = 0 > > ioctl(13, SIOCGIWRATE, 0xbfbcdbbc) = 0 > > Y

Re: 2.6.18-mm2

2006-09-30 Thread Andrew Morton
On Sat, 30 Sep 2006 15:37:06 +0200 Tobias Diedrich <[EMAIL PROTECTED]> wrote: > Andrew Morton wrote: > > > - More updates to the MSI code. If your machine has Message Signalled > > Interrupts, please enable it and give it a try. > > I'm happy to report, t

Re: 2.6.18-mm2 - oops in cache_alloc_refill()

2006-09-30 Thread Andrew Morton
mmits - I reverted them and the > resulting 2.6.18-mm2 kernel has been up and stable for 4 hours, even with > the problem gkrellm updating once a second the whole time. > > I'm not *seeing* how those changes can cause trouble - unless it's this: > > diff --git a/driv

Re: 2.6.18-mm2 - oops in cache_alloc_refill()

2006-09-30 Thread Valdis . Kletnieks
On Fri, 29 Sep 2006 23:31:07 EDT, [EMAIL PROTECTED] said: > Fair enough, I'm going to try reverting the 2 commits and see if things > behave better. OK, it's definitely something in those 2 commits - I reverted them and the resulting 2.6.18-mm2 kernel has been up and stable f

Re: 2.6.18-mm2 - oops in cache_alloc_refill()

2006-09-29 Thread Valdis . Kletnieks
On Fri, 29 Sep 2006 18:33:48 PDT, Jean Tourrilhes said: > On Fri, Sep 29, 2006 at 06:20:08PM -0700, Andrew Morton wrote: > > On Fri, 29 Sep 2006 20:01:54 -0400 > > > > > > Here's the traceback I got: > > > > > > slab error in verify_redzone_free(): cache `size-32': memory outside > > > object wa

Re: 2.6.18-mm2 - oops in cache_alloc_refill()

2006-09-29 Thread Valdis . Kletnieks
On Fri, 29 Sep 2006 18:40:43 PDT, Jean Tourrilhes said: > On Fri, Sep 29, 2006 at 06:20:08PM -0700, Andrew Morton wrote: > > On Fri, 29 Sep 2006 20:01:54 -0400 > > > > > > A quick strace of gkrellm finds these likely ioctl's causing the problem: > > > > > > % grep ioctl /tmp/foo2 | sort -u | more

Re: 2.6.18-mm2 - oops in cache_alloc_refill()

2006-09-29 Thread Jean Tourrilhes
On Fri, Sep 29, 2006 at 06:20:08PM -0700, Andrew Morton wrote: > On Fri, 29 Sep 2006 20:01:54 -0400 > > > > A quick strace of gkrellm finds these likely ioctl's causing the problem: > > > > % grep ioctl /tmp/foo2 | sort -u | more > > ioctl(13, SIOCGIWESSID, 0xbfbcdb9c) = 0 > > ioctl(13, SIOCG

Re: 2.6.18-mm2 - oops in cache_alloc_refill()

2006-09-29 Thread Jean Tourrilhes
On Fri, Sep 29, 2006 at 06:20:08PM -0700, Andrew Morton wrote: > On Fri, 29 Sep 2006 20:01:54 -0400 > > > > Here's the traceback I got: > > > > slab error in verify_redzone_free(): cache `size-32': memory outside object > > was overwritten > > [] dump_trace+0x64/0x1cd > > [] show_trace_log_lvl+0

Re: 2.6.18-mm2 - oops in cache_alloc_refill()

2006-09-29 Thread Andrew Morton
On Fri, 29 Sep 2006 20:01:54 -0400 [EMAIL PROTECTED] wrote: > On Fri, 29 Sep 2006 12:45:58 PDT, Andrew Morton said: > > (Adding a bunch of people to the cc: list now that I have a clue what is > going on) > > > I'd expect it's the same bug - slab data structures have gone bad. > > *bing*! W

Re: sky2 (was Re: 2.6.18-mm2)

2006-09-29 Thread Matthias Hentges
Hello Stephen, Am Donnerstag, den 28.09.2006, 16:19 -0700 schrieb Stephen Hemminger: [...] > Here is the debug patch I sent to the first reporter of the problem. > I know what the offset is supposed to be, so if the PCI subsystem is > wrong, this will show. > > --- sky2.orig/drivers/net/sky2.c

Re: 2.6.18-mm2 - oops in cache_alloc_refill()

2006-09-29 Thread Valdis . Kletnieks
On Fri, 29 Sep 2006 12:45:58 PDT, Andrew Morton said: (Adding a bunch of people to the cc: list now that I have a clue what is going on) > I'd expect it's the same bug - slab data structures have gone bad. *bing*! We have a winner. A quick check showed the kernel wasn't built with slab debu

Re: sky2 (was Re: 2.6.18-mm2)

2006-09-29 Thread Andrea Gelmini
On Thu, Sep 28, 2006 at 04:30:23PM -0700, Stephen Hemminger wrote: > Note: I know what is causing all the sky2 problems, there is something wrong > that > is causing flow control negotiation not to propagate back to all the multiple > levels > of the chip. Unclear how to fix it, the documentation

Re: sky2 (was Re: 2.6.18-mm2)

2006-09-28 Thread Stephen Hemminger
> > See also http://bugzilla.kernel.org/show_bug.cgi?id=7222 > > That's two reports in 18 hours, from amongst the presumably-small population > of sky2-owning -mm testers. I'll back it out if we don't get a simple resolution. It was just trying to use the pci facilities as intended. Note: I kn

Re: sky2 (was Re: 2.6.18-mm2)

2006-09-28 Thread Andrew Morton
t; > > To: Andrew Morton <[EMAIL PROTECTED]> > > Cc: linux-kernel@vger.kernel.org > > Subject: Re: 2.6.18-mm2 > > > > > > Hello all, > > > > I've just tested -mm2 on my C2D system and I'm getting a lot of these > > messages: &g

Re: sky2 (was Re: 2.6.18-mm2)

2006-09-28 Thread Stephen Hemminger
t; > > To: Andrew Morton <[EMAIL PROTECTED]> > > Cc: linux-kernel@vger.kernel.org > > Subject: Re: 2.6.18-mm2 > > > > > > Hello all, > > > > I've just tested -mm2 on my C2D system and I'm getting a lot of these > > messages: &g

sky2 (was Re: 2.6.18-mm2)

2006-09-28 Thread Jeff Garzik
Andrew Morton wrote: Another customer.. Begin forwarded message: Date: Fri, 29 Sep 2006 00:44:01 +0200 From: Matthias Hentges <[EMAIL PROTECTED]> To: Andrew Morton <[EMAIL PROTECTED]> Cc: linux-kernel@vger.kernel.org Subject: Re: 2.6.18-mm2 Hello all, I've just tested -mm2

Re: 2.6.18-mm2

2006-09-28 Thread Stephen Hemminger
Morton wrote: > > > > > ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.18/2.6.18-mm2/ > > > > Panic on boot. This machine booted 2.6.18-mm1 fine. em64t machine. > > > > TCP bic registered > > TCP westwood registered > >

Re: 2.6.18-mm2

2006-09-28 Thread Andrew Morton
(please always do reply-to-all) On Thu, 28 Sep 2006 17:50:31 + (UTC) "Steve Fox" <[EMAIL PROTECTED]> wrote: > On Thu, 28 Sep 2006 01:46:23 -0700, Andrew Morton wrote: > > > ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.18/2.6.18-mm2/ >

Re: [Bugme-new] [Bug 7222] New: sky2 throws a lot of "pci express error" in 2.6.18-mm2 on amd64

2006-09-28 Thread Andrew Morton
On Thu, 28 Sep 2006 05:10:43 -0700 [EMAIL PROTECTED] wrote: > http://bugzilla.kernel.org/show_bug.cgi?id=7222 > >Summary: sky2 throws a lot of "pci express error" in 2.6.18-mm2 > on amd64 > Kernel Version: 2.6.18-mm2