Re: ip_defrag is broken (was: Re: test12 lockups -- need feedback)

2000-12-17 Thread Rusty Russell
In message <[EMAIL PROTECTED]> you write: >Date: Thu, 14 Dec 2000 15:35:48 -0500 (EST) >From: "Mohammad A. Haque" <[EMAIL PROTECTED]> > >I'll be trying in a few hours. > > Meanwhile for people wanting the crashes to be fixed, please > apply this patch. > > This was _always_ broken,

Re: test12 lockups -- need feedback

2000-12-15 Thread mike
I have a DLink DFE-530TX+ with a RTL8139 and I lock up cold every once in a while too. 2.4.0-test12-pre3 is the latest kernel I've tried. The machine is a dual PII450 on a Tyan Tiger 100 BX board w/ 128MB. Locks up cold meaning "It's dead Jim". Non sysrq facilities available and no Oops trail

Re: test12 lockups -- need feedback

2000-12-15 Thread Ingo Oeser
On Fri, Dec 15, 2000 at 06:06:58PM +, Alan Cox wrote: > > > This was on Cyrix III. > > Please include the oops information, as well as the /proc/cpuinfo output. > Also be sure you built Pentium/TSC kernels as Cyrix III is a 686 core without > the cmov instruction it seems I did. And built wit

Re: test12 lockups -- need feedback

2000-12-15 Thread Ingo Oeser
On Fri, Dec 15, 2000 at 09:52:22AM -0800, H. Peter Anvin wrote: > > This was on Cyrix III. > > Please include the oops information, as well as the /proc/cpuinfo output. processor : 0 vendor_id : CentaurHauls cpu family : 6 model : 6 model name : WinChip ?? steppin

Re: test12 lockups -- need feedback

2000-12-15 Thread Alan Cox
> > I also got a hard lockup (but with Oops) while calling the > > "vendor CPU init" function during system boot. > > > > This was on Cyrix III. > > PS: CC'ed hpa, because he is cpu-detection maintainer and davej, > >because he added Cyrix III support and might know details ;-) > > Please in

Re: test12 lockups -- need feedback

2000-12-15 Thread H. Peter Anvin
> > I have no Realtek-Card and have the same lockup. > > I also got a hard lockup (but with Oops) while calling the > "vendor CPU init" function during system boot. > > This was on Cyrix III. > > PS: CC'ed hpa, because he is cpu-detection maintainer and davej, >because he added Cyrix III s

Re: test12 lockups -- need feedback

2000-12-15 Thread Ingo Oeser
On Thu, Dec 14, 2000 at 06:52:34PM +, Eckhard Jokisch wrote: > Is it possible that there is something wrong with the 8139too driver? > ( I also use a card with 8139 chip ) > Or do you use the "old" rtl8139 ? With that I don't have any problems. > I have an extra machine here where I can do al

Re: test12 lockups -- need feedback

2000-12-15 Thread Ingo Oeser
On Thu, Dec 14, 2000 at 06:42:58AM -0500, Mohammad A. Haque wrote: > Hmmm, does syslog sending to another machine catch oops? I guess we'll > find out. No, I asked for the logs and he didn't receive any of them :-( Regards Ingo Oeser -- 10.+11.03.2001 - 3. Chemnitzer LinuxTag

Re: ip_defrag is broken (was: Re: test12 lockups -- need feedback)

2000-12-15 Thread Tom Leete
"Mohammad A. Haque" wrote: > > I do the following > > sudo modprobe iptable_nat > > Module Size Used by > iptable_nat17440 0 (unused) > ip_conntrack 19808 1 [iptable_nat] > ip_tables 12320 3 [iptable_nat] > > Oops start flying by w

Re: ip_defrag is broken (was: Re: test12 lockups -- need feedback)

2000-12-15 Thread Jasper Spaans
On Thu, Dec 14, 2000 at 05:50:35PM -0500, Mohammad A. Haque wrote: [zap] > Oops start flying by when I access via NFS. > > If you need the actual Oops messages we're gonna have to get someone > who can setup a serial console. I captured one on my console, anyone interested please drop me a not

Re: ip_defrag is broken (was: Re: test12 lockups -- need feedback)

2000-12-14 Thread Tom Leete
"David S. Miller" wrote: > >Date: Thu, 14 Dec 2000 15:35:48 -0500 (EST) >From: "Mohammad A. Haque" <[EMAIL PROTECTED]> > >I'll be trying in a few hours. > > Meanwhile for people wanting the crashes to be fixed, please > apply this patch. > > This was _always_ broken, and really wha

Re: Netfilter is broken (was Re: ip_defrag is broken (was: Re: test12 lockups -- need feedback))

2000-12-14 Thread Harald Welte
On Fri, Dec 15, 2000 at 01:48:32AM +0100, Andi Kleen wrote: > > Also is it sure that the backtrace involves ip_rcv ? A more likely > guess is that it happens during the IP_LOCAL_OUT hook, when skb->dev > isn't set yet, but conntrack already has to already reassemble fragments. Oh, thanks Andi.

Re: Netfilter is broken (was Re: ip_defrag is broken (was: Re: test12 lockups -- need feedback))

2000-12-14 Thread Andi Kleen
On Thu, Dec 14, 2000 at 04:11:10PM -0800, David S. Miller wrote: >Date: Fri, 15 Dec 2000 01:20:00 +0100 >From: Harald Welte <[EMAIL PROTECTED]> > >Or is there something wrong with: > >- packet arrives in net/ipv4/ip_input.c:ip_rcv() >- netfilter hook NF_IP_PRE_ROUTING is call

Re: Netfilter is broken (was Re: ip_defrag is broken (was: Re: test12 lockups -- need feedback))

2000-12-14 Thread David S. Miller
Date: Fri, 15 Dec 2000 01:20:00 +0100 From: Harald Welte <[EMAIL PROTECTED]> Or is there something wrong with: - packet arrives in net/ipv4/ip_input.c:ip_rcv() - netfilter hook NF_IP_PRE_ROUTING is called - net/ipv4/netfilter/ip_conntrack_core.c:ip_conntrack_in() is called -

Re: Netfilter is broken (was Re: ip_defrag is broken (was: Re: test12 lockups -- need feedback))

2000-12-14 Thread Harald Welte
On Thu, Dec 14, 2000 at 11:55:43AM -0800, David S. Miller wrote: >Date: Thu, 14 Dec 2000 12:07:38 -0800 (PST) >From: Ion Badulescu <[EMAIL PROTECTED]> > >I'm afraid I won't be able to answer this question, since I'm >leaving for a 3-week vacation in about 50 minutes and I need my

Re: ip_defrag is broken (was: Re: test12 lockups -- need feedback)

2000-12-14 Thread Mohammad A. Haque
Problem only happens when ip_conntrack is loaded. On Thu, 14 Dec 2000, Mohammad A. Haque wrote: > I do the following > > sudo modprobe iptable_nat > > Module Size Used by > iptable_nat17440 0 (unused) > ip_conntrack 19808 1 [iptable_nat] > ip_table

Re: ip_defrag is broken (was: Re: test12 lockups -- need feedback)

2000-12-14 Thread Mohammad A. Haque
I do the following sudo modprobe iptable_nat Module Size Used by iptable_nat17440 0 (unused) ip_conntrack 19808 1 [iptable_nat] ip_tables 12320 3 [iptable_nat] Oops start flying by when I access via NFS. If you need the actual Oop

Re: ip_defrag is broken (was: Re: test12 lockups -- need feedback)

2000-12-14 Thread Bob_Tracy
Ion Badulescu wrote: > On Thu, 14 Dec 2000 07:15:04 -0500, Mohammad A. Haque <[EMAIL PROTECTED]> wrote: > > Were you connected to a network or receiving/sending anything? > > ip_defrag is broken -- there is an obvious NULL pointer dereference > in it, introduced in test12. It doesn't hit normally

Re: ip_defrag is broken (was: Re: test12 lockups -- need feedback)

2000-12-14 Thread Mohammad A. Haque
Just quick feedback. Test 1: Netfilter compiled into kernel. Netfilter configuration options as modules. Modules loaded. Using NFS, I got Oops (in fact I've never seen an Oops output infinitely before. Maybe it would have stopped if I waited.) Test 2: Netf

Re: ip_defrag is broken (was: Re: test12 lockups -- need feedback)

2000-12-14 Thread Bob_Tracy
Ion Badulescu wrote: > On Thu, 14 Dec 2000 07:15:04 -0500, Mohammad A. Haque <[EMAIL PROTECTED]> wrote: > > Were you connected to a network or receiving/sending anything? > > ip_defrag is broken -- there is an obvious NULL pointer dereference > in it, introduced in test12. It doesn't hit normally

Re: ip_defrag is broken (was: Re: test12 lockups -- need feedback)

2000-12-14 Thread David S. Miller
Date: Thu, 14 Dec 2000 15:35:48 -0500 (EST) From: "Mohammad A. Haque" <[EMAIL PROTECTED]> I'll be trying in a few hours. Meanwhile for people wanting the crashes to be fixed, please apply this patch. This was _always_ broken, and really what netfilter is doing should have never worked.

Re: ip_defrag is broken (was: Re: test12 lockups -- need feedback)

2000-12-14 Thread Mohammad A. Haque
I'll be trying in a few hours. On Thu, 14 Dec 2000, Ion Badulescu wrote: > On Thu, 14 Dec 2000, David S. Miller wrote: > > > If you turn off netfilter, ip_conntrack, etc. does the OOPS still > > occur? > > I'm afraid I won't be able to answer this question, since I'm leaving for > a 3-week vacat

Netfilter is broken (was Re: ip_defrag is broken (was: Re: test12 lockups -- need feedback))

2000-12-14 Thread David S. Miller
Date: Thu, 14 Dec 2000 12:07:38 -0800 (PST) From: Ion Badulescu <[EMAIL PROTECTED]> I'm afraid I won't be able to answer this question, since I'm leaving for a 3-week vacation in about 50 minutes and I need my firewall functional until then. :-) Maybe other people who have seen

Re: ip_defrag is broken (was: Re: test12 lockups -- need feedback)

2000-12-14 Thread Ion Badulescu
On Thu, 14 Dec 2000, David S. Miller wrote: > If you turn off netfilter, ip_conntrack, etc. does the OOPS still > occur? I'm afraid I won't be able to answer this question, since I'm leaving for a 3-week vacation in about 50 minutes and I need my firewall functional until then. :-) Maybe other p

Re: ip_defrag is broken (was: Re: test12 lockups -- need feedback)

2000-12-14 Thread David S. Miller
Date: Thu, 14 Dec 2000 11:52:29 -0800 (PST) From: Ion Badulescu <[EMAIL PROTECTED]> The oops looks something like this. It was caught on serial console, and decoded on test11, so it doesn't have translation for module symbols. It if helps, this box is running ip_conntrack and th

Re: ip_defrag is broken (was: Re: test12 lockups -- need feedback)

2000-12-14 Thread Ion Badulescu
On Thu, 14 Dec 2000, David S. Miller wrote: >Date: Thu, 14 Dec 2000 10:38:01 -0800 >From: Ion Badulescu <[EMAIL PROTECTED]> > >I won't venture a fix, as I don't know the networking code well >enough. So far, no networking maintainer has had anything to say >about this bug

Re: ip_defrag is broken (was: Re: test12 lockups -- need feedback)

2000-12-14 Thread David S. Miller
Date:Thu, 14 Dec 2000 10:38:01 -0800 From: Ion Badulescu <[EMAIL PROTECTED]> I won't venture a fix, as I don't know the networking code well enough. So far, no networking maintainer has had anything to say about this bug on the list... Because this is the first most of us

ip_defrag is broken (was: Re: test12 lockups -- need feedback)

2000-12-14 Thread Ion Badulescu
On Thu, 14 Dec 2000 07:15:04 -0500, Mohammad A. Haque <[EMAIL PROTECTED]> wrote: > Were you connected to a network or receiving/sending anything? ip_defrag is broken -- there is an obvious NULL pointer dereference in it, introduced in test12. It doesn't hit normally, because of path MTU discovery

Re: test12 lockups -- need feedback

2000-12-14 Thread Eckhard Jokisch
Subject: Re: test12 lockups -- need feedback Date: Thu, 14 Dec 2000 15:31:38 + From: Eckhard Jokisch <[EMAIL PROTECTED]> To: dep <[EMAIL PROTECTED]> On Don, 14 Dez 2000, dep wrote: > On Thursday 14 December 2000 07:15, Mohammad A. Haque wrote: > | Were you connect

Re: test12 lockups -- need feedback

2000-12-14 Thread Bob_Tracy
Mohammad A. Haque wrote: > Were you connected to a network or receiving/sending anything? > > dep wrote: > > > > okay. got it here this morning, too. solid lock -- no dumping out of > > x, no changing terminals, no mouse, no keyboard. > > > > k6-2-550 @ 500; 256mb memory, fic 503a mb with via c

Re: test12 lockups -- need feedback

2000-12-14 Thread dep
On Thursday 14 December 2000 07:15, Mohammad A. Haque wrote: | Were you connected to a network or receiving/sending anything? a conditional yes -- little lan here, d-link dfe-530tx+ (rtl8139) to dlink hub, di-701 gateway, cable modem. so far as i know, i was neither sending nor receiving at the

Re: test12 lockups -- need feedback

2000-12-14 Thread Mohammad A. Haque
Were you connected to a network or receiving/sending anything? dep wrote: > > okay. got it here this morning, too. solid lock -- no dumping out of > x, no changing terminals, no mouse, no keyboard. > > k6-2-550 @ 500; 256mb memory, fic 503a mb with via chipset. kernel > built with gcc-2.95-2 ag

Re: test12 lockups -- need feedback

2000-12-14 Thread dep
okay. got it here this morning, too. solid lock -- no dumping out of x, no changing terminals, no mouse, no keyboard. k6-2-550 @ 500; 256mb memory, fic 503a mb with via chipset. kernel built with gcc-2.95-2 against glibc-2.2. nothing remarkable underway -- was composing a message in kmail, whi

Re: test12 lockups -- need feedback

2000-12-14 Thread Mohammad A. Haque
Hmmm, does syslog sending to another machine catch oops? I guess we'll find out. Ingo Oeser wrote: > I have no oops yet of this lockup, because of X, but I'll ask a > friend of mine, whether the remote logging made it to him and > send you the results. -- ==

Re: test12 lockups -- need feedback

2000-12-14 Thread Ingo Oeser
On Wed, Dec 13, 2000 at 10:48:56PM -0500, Mohammad A. Haque wrote: > Trace; c0105000 > Trace; c0100191 I locked a Cyrix III machine up on boot and hat these both elements in my trace, too. It Oopsed and locked up after the Message: "CPU: Before vendor init". I locked up too with another machi

Re: test12 lockups -- need feedback

2000-12-14 Thread Martin Bahlinger
In article <[EMAIL PROTECTED]> you wrote: > At first I thought it was just me when I reported the lockups I was > having with test12 earlier this week. Now the reports are flooding. Of > course, now my machine isn't locking up anymore after recompiling from a > clean source tree (test5 w/ patches

Re: test12 lockups -- need feedback

2000-12-13 Thread Mohammad A. Haque
Here we go folks. I hope I got everything right. The only place I have a doubt is the 0010: part of EIP. I couldn't read what I wrote there. Looks like it's ip fragment related? ksymoops 0.7c on i686 2.4.0-test11. Options used -V (default) -K (specified) -L (specified) -o /li

Re: test12 lockups -- need feedback

2000-12-13 Thread Mohammad A. Haque
Ok, got locked up. Dropped me into kdb and I was able to write down the oops after doing a ss on btp 0. I'll try to have something posted in an hour. On Wed, 13 Dec 2000, Mohammad A. Haque wrote: > At first I thought it was just me when I reported the lockups I was > having with test12 earlier

Re: test12 lockups -- need feedback

2000-12-13 Thread dep
On Wednesday 13 December 2000 19:29, Mikael Djurfeldt wrote: | > I downloaded the full test12 and have lockups after using X | > (upstream version 4.0.1Z) 15-45 mins. For me, SysRq+u works, but | > if I then press SysRq+b, nothing happens. There are no signs in | > the syslog. | | I should add

Re: test12 lockups -- need feedback

2000-12-13 Thread Mikael Djurfeldt
Mikael Djurfeldt <[EMAIL PROTECTED]> writes: > "Mohammad A. Haque" <[EMAIL PROTECTED]> writes: > > > Those of you who are having lockups, was test12 compiled from a patched > > tree that you've previously compiled? > > I downloaded the full test12 and have lockups after using X (upstream > vers

Re: test12 lockups -- need feedback

2000-12-13 Thread Mikael Djurfeldt
"Mohammad A. Haque" <[EMAIL PROTECTED]> writes: > Those of you who are having lockups, was test12 compiled from a patched > tree that you've previously compiled? I downloaded the full test12 and have lockups after using X (upstream version 4.0.1Z) 15-45 mins. For me, SysRq+u works, but if I the

test12 lockups -- need feedback

2000-12-13 Thread Mohammad A. Haque
At first I thought it was just me when I reported the lockups I was having with test12 earlier this week. Now the reports are flooding. Of course, now my machine isn't locking up anymore after recompiling from a clean source tree (test5 w/ patches through test12) Now, I'm trying to determine what