Re: IPv6 not responding on some aliases (recent 8-stable)

2011-12-30 Thread Jason Hellenthal


On Fri, Dec 23, 2011 at 09:17:09AM +, Bjoern A. Zeeb wrote:
> 
> On 22. Dec 2011, at 20:39 , Marcin Cieslak wrote:
> 
> >>> Bjoern A. Zeeb  wrote:
> >>> I initially thought it's a transport layer issue, since previously (before
> >>> I changed configuration) 30%-50% SSH connection attempts succeeded
> >>> (but prefix was wrong on the "primary" IPv6 address :1000).
> >>> Now I get no packets on receiving side at all for those "broken" IPv6 
> >>> addresses.
> >> 
> >> Talk to ywhomever is providing in front of you to
> >> 1) either relax nd6 table limits or
> >> 2) to route a /64 to your host to only have 1 entry in the neighbour table.
> >> 
> >> That's most likely the problem given my crystal ball and experience.
> > 
> > Thank you for insightful analysis! 
> > Seems like this provider has some significant IPv6 takeup, which is
> > good news - sorry for hassle, but problems started after upgrade. 
> > 
> > I'll talk to my upstream then, thanks!
> 
> Please let us know of the results, especially if my crystal ball was wrong.
> 

I have seen this behavior before when one of the addresses on an interface is 
in a DMZ while the others are not. But this was with IPv4. I would assume IPv6 
would have acted the same way but left it untested as it was not critical. Take 
this as informational only and double check your switches, firewalls, etc...

-- 
;s =;


pgpnUtwZzzkKk.pgp
Description: PGP signature


Re: Panic in the udp_input() under heavy load

2011-12-30 Thread Maxim Sobolev

On 11/24/2011 11:24 PM, Robert N. M. Watson wrote:

There was recently a commit to fix a race condition in 10-CURRENT which
>>  I think is not slated to be merged for 9.0. You might check the commit
>>  logs there and see if that fixes the problems you have -- if so, we
>>  might want to reconsider the plan not to merge for 9.0.
>>
>>  (It relates to a race condition on closing sockets..)

>
>  Thank you for the tip. I will give it a try and see what happens. So far, 
after installing that trap we have not seen any panics yet. I have not checked 
logs yet if my trap actually has catch anything or not.

Do we know if this fix has been merged to stable/9 and releng/9.0? Given 
multiple reports of instability without it, I think we would be well-served to 
merge it at this point.


Hey Robert, sorry to bother you again, but can you at least point me 
towards specific SVN revisions that I need to get merged? I tried to 
google it and also browsed my svn history for the last 4 months using 
keyword "sockets", but nothing came up. This fix is critical for any 
system that does lot of fault-ctitical networking, and FreeBSD has 
always been solid in this regard. It saved us at least 10-15 crashes 
across 5 machines in the last month.


Thanks!

-Maxim
___
freebsd-net@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to "freebsd-net-unsubscr...@freebsd.org"


Re: ng_mppc_decompress: too many (4094) packets dropped, disabling node

2011-12-30 Thread Gleb Smirnoff
  Sami,

On Tue, Dec 27, 2011 at 11:08:47PM +0200, Sami Halabi wrote:
S> Thanks for your patch, i applied it and its production already.
S> i had to stop mpd, and once started it i saw that all home routers
S> connected immediatly.
S> most of them don't use mppc, so I wonder why this problem happend in the
S> first place.
S> whats surprising i had few hours ago the same problem:
S> Dec 27 14:11:29 mpd2 kernel: ng_mppc_decompress: too many (4092) packets
S> dropped, disabling node 0xff000ba12700!
S> Dec 27 14:11:29 mpd2 kernel:
S> Dec 27 19:19:10 mpd2 kernel: ng_mppc_decompress: too many (4092) packets
S> dropped, disabling node 0xff0006307400!
S> Dec 27 19:19:10 mpd2 kernel:
S> 
S> i hope the patch will solve the problem for good now, i'll let it work for
S> the coming few days andif something will go wrong i'll paste you.

How is the patched kernel running?

Do you use L2TP or PPPoE?

/me uses PPTP mostly, which handles any reordering problems correctly, so
I can't even test my patch. If I produce a reordering event artifically,
the ng_pptpgre filters bogus packet out.

-- 
Totus tuus, Glebius.
___
freebsd-net@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to "freebsd-net-unsubscr...@freebsd.org"


Re: Panic in the udp_input() under heavy load

2011-12-30 Thread Robert N. M. Watson

On 30 Dec 2011, at 19:48, Maxim Sobolev wrote:

> On 11/24/2011 11:24 PM, Robert N. M. Watson wrote:
 There was recently a commit to fix a race condition in 10-CURRENT which
 >>  I think is not slated to be merged for 9.0. You might check the commit
 >>  logs there and see if that fixes the problems you have -- if so, we
 >>  might want to reconsider the plan not to merge for 9.0.
 >>
 >>  (It relates to a race condition on closing sockets..)
>>> >
>>> >  Thank you for the tip. I will give it a try and see what happens. So 
>>> > far, after installing that trap we have not seen any panics yet. I have 
>>> > not checked logs yet if my trap actually has catch anything or not.
>> Do we know if this fix has been merged to stable/9 and releng/9.0? Given 
>> multiple reports of instability without it, I think we would be well-served 
>> to merge it at this point.
> 
> Hey Robert, sorry to bother you again, but can you at least point me towards 
> specific SVN revisions that I need to get merged? I tried to google it and 
> also browsed my svn history for the last 4 months using keyword "sockets", 
> but nothing came up. This fix is critical for any system that does lot of 
> fault-ctitical networking, and FreeBSD has always been solid in this regard. 
> It saved us at least 10-15 crashes across 5 machines in the last month.

Hi Maxim:

Looking back at a recent post from you, it appears that you are on 8.x and not 
9.x, as I had assumed form your original e-mail. The patch I was referring to 
in 9-CURRENT has long since been merged for 9.0 and will appear in that 
release. However, it does not apply to 8.x, as the bug it fixed was introduced 
during the 9.x cycle. We'll need to do a from-scratch diagnosis here rather 
than assume it's the same problem. Could I ask you to follow up to this post 
with version information, stack traces from relevant threads, etc? I am not 
aware of any other reports of UDP-related crashes along the lines of what 
you've described in 8.x, so it may be being triggered by some unusual aspect of 
your workload (or just bad luck). Sorry that there's no instant "merge a patch" 
fix for this one.

Thanks,

Robert___
freebsd-net@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to "freebsd-net-unsubscr...@freebsd.org"


Re: IPv6 not responding on some aliases (recent 8-stable)

2011-12-30 Thread Marcin Cieslak
> I have seen this behavior before when one of the addresses on an interface =
> is in a DMZ while the others are not. But this was with IPv4. I would assum=
> e IPv6 would have acted the same way but left it untested as it was not cri=
> tical. Take this as informational only and double check your switches, fire=
> walls, etc...

Unfortunately, this is a hosting provider. I have rebooted the box
to use their custom rescue netboot image (based on FreeBSD 8.0 running
on QEMU) and ... still one of the addresses didn't work in this setup.
However, two reboots later situation returned to normal, and all
IPv6 addresses respond. NDP table theory sounds plausible to me,
except... connection establishment to the IPv6 address port 22/tcp
takes sometimes noticeably too long (other TCP ports are usually fine).

But this is probably another story...

//Marcin

___
freebsd-net@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to "freebsd-net-unsubscr...@freebsd.org"


Re: Panic in the udp_input() under heavy load

2011-12-30 Thread Maxim Sobolev

On 12/30/2011 3:31 PM, Robert N. M. Watson wrote:

Looking back at a recent post from you, it appears that you are on 8.x and not 9.x, as I 
had assumed form your original e-mail. The patch I was referring to in 9-CURRENT has long 
since been merged for 9.0 and will appear in that release. However, it does not apply to 
8.x, as the bug it fixed was introduced during the 9.x cycle. We'll need to do a 
from-scratch diagnosis here rather than assume it's the same problem. Could I ask you to 
follow up to this post with version information, stack traces from relevant threads, etc? 
I am not aware of any other reports of UDP-related crashes along the lines of what you've 
described in 8.x, so it may be being triggered by some unusual aspect of your workload 
(or just bad luck). Sorry that there's no instant "merge a patch" fix for this 
one.


I see. Would you guys mind if I put that NULL pointer check into the 
code for the time being and turn it into some kind of big nasty warning 
in 8-stable branch only? As far as I can tell, the most harm it can do 
is some memory being leaked. And this condition happens just few times a 
day, so it's not a biggie.


By the way it would be interesting project to put libexecinfo interface 
into the FreeBSD kernel. It would provide interesting new aspect of 
debugging such rare non-critical conditions by dumping stack traces and 
possibly doing some damage control instead of plainly crashing.


-Maxim
___
freebsd-net@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to "freebsd-net-unsubscr...@freebsd.org"


Re: Panic in the udp_input() under heavy load

2011-12-30 Thread Maxim Sobolev

On 12/30/2011 4:46 PM, Maxim Sobolev wrote:

I see. Would you guys mind if I put that NULL pointer check into the
code for the time being and turn it into some kind of big nasty warning
in 8-stable branch only?


I could also open a ticket, put all debug information collected to date 
in there. And encourage people to report to it once they see this 
warning on their system. Then it would provide more information about 
the exposure. It is definitely looks like locking issue somewhere, not 
just bad luck or flaky hardware, as we see it happening consistently on 
top 4 most UDP-loaded systems here, and it correlates well with the 
load. With my small NULL catch the machines have been running happily 
for a month now, so there is no visible side-effects.


-Maxim
___
freebsd-net@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to "freebsd-net-unsubscr...@freebsd.org"


RE: kern/161908: [netgraph] [patch] ng_vlan update for QinQ support

2011-12-30 Thread rozhuk . im

Who can commit?


> -Original Message-
> From: owner-freebsd-...@freebsd.org [mailto:owner-freebsd-
> n...@freebsd.org] On Behalf Of ead...@freebsd.org
> Sent: Monday, October 24, 2011 7:35 AM
> To: ead...@freebsd.org; freebsd-b...@freebsd.org; freebsd-
> n...@freebsd.org
> Subject: Re: kern/161908: [netgraph] [patch] ng_vlan update for QinQ
> support
> 
> Old Synopsis: ng_vlan update for QinQ support New Synopsis: [netgraph]
> [patch] ng_vlan update for QinQ support
> 
> Responsible-Changed-From-To: freebsd-bugs->freebsd-net
> Responsible-Changed-By: eadler
> Responsible-Changed-When: Sun Oct 23 22:34:17 UTC 2011
> Responsible-Changed-Why:
> assign and add tags
> 
> http://www.freebsd.org/cgi/query-pr.cgi?pr=161908
> ___
> freebsd-net@freebsd.org mailing list
> http://lists.freebsd.org/mailman/listinfo/freebsd-net
> To unsubscribe, send any mail to "freebsd-net-unsubscr...@freebsd.org"


ng_vlan.patch
Description: Binary data
___
freebsd-net@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to "freebsd-net-unsubscr...@freebsd.org"

Re: IPv6 not responding on some aliases (recent 8-stable)

2011-12-30 Thread Jason Hellenthal


On Sat, Dec 31, 2011 at 12:35:00AM +, Marcin Cieslak wrote:
> > I have seen this behavior before when one of the addresses on an interface =
> > is in a DMZ while the others are not. But this was with IPv4. I would assum=
> > e IPv6 would have acted the same way but left it untested as it was not cri=
> > tical. Take this as informational only and double check your switches, fire=
> > walls, etc...
> 
> Unfortunately, this is a hosting provider. I have rebooted the box
> to use their custom rescue netboot image (based on FreeBSD 8.0 running
> on QEMU) and ... still one of the addresses didn't work in this setup.
> However, two reboots later situation returned to normal, and all
> IPv6 addresses respond. NDP table theory sounds plausible to me,
> except... connection establishment to the IPv6 address port 22/tcp
> takes sometimes noticeably too long (other TCP ports are usually fine).
> 
> But this is probably another story...
> 

Speaking just in the terms of too long of a connection wait on port 22. 
Firewall off port 113 in and out.

block drop  in  log quick proto tcp to any port = auth
block   out quick proto tcp to any port = auth


That should help in terms of the speed at which you connect. Good luck with the 
OP though.


Happy Holiday...

-- 
;s =;


pgpco2n0N9ykB.pgp
Description: PGP signature


Re: kern/161908: [netgraph] [patch] ng_vlan update for QinQ support

2011-12-30 Thread Adrian Chadd
On 30 December 2011 21:48,   wrote:
>
> Who can commit?
>

Bug me after January 3rd. :)


Adrian
___
freebsd-net@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to "freebsd-net-unsubscr...@freebsd.org"