> 21 sep 2015 kl. 10:28 skrev Julien Charbon <j...@freebsd.org>:
> 
> 
> Hi Palle,
> 
> On 18/09/15 22:42, Palle Girgensohn wrote:
>>> 18 sep 2015 kl. 18:06 skrev Konstantin Belousov
>>> <kostik...@gmail.com>:
>>> 
>>>> On Fri, Sep 18, 2015 at 03:56:25PM +0200, Julien Charbon wrote: 
>>>> Hi Palle,
>>>> 
>>>>> On 18/09/15 11:12, Palle Girgensohn wrote: We see daily panics
>>>>> on our production systems (web server, apache running MPM
>>>>> event, openjdk8. Kernel with VIMAGE. Jails using netgraph 
>>>>> interfaces [not epair]).
>>>>> 
>>>>> The problem started after the summer. Normal port upgrades
>>>>> seems to be the only difference. The problem occurs with
>>>>> 10.2-p2 kernel as well as 10.1-p4 and 10.1-p15.
>>>>> 
>>>>> https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=203175
>>>>> 
>>>>> Any ideas?
>>>> 
>>>> Thanks for you detailed report.  I am not aware of any
>>>> tcp_twclose() related issues (without VIMAGE) since FreeBSD 10.0
>>>> (does not mean there are none).  Few interesting facts (at least
>>>> for me):
>>>> 
>>>> - Your crash happens when unlocking a inp exclusive lock with
>>>> INP_WUNLOCK()
>>>> 
>>>> - Something is already wrong before calling turnstile_broadcast()
>>>> as it is called with ts = NULL:
>>> In the kernel without witness this is a 99%-sure indication of
>>> attempt to unlock not owned lock.
>>> 
>>>> 
>>>> turnstile_broadcast (ts=0x0, queue=1) at 
>>>> /usr/src/sys/kern/subr_turnstile.c:838 __rw_wunlock_hard () at
>>>> /usr/src/sys/kern/kern_rwlock.c:988 tcp_twclose () at
>>>> /usr/src/sys/netinet/tcp_timewait.c:540 tcp_tw_2msl_scan () at
>>>> /usr/src/sys/netinet/tcp_timewait.c:748 tcp_slowtimo () at
>>>> /usr/src/sys/netinet/tcp_timer.c:198
>>>> 
>>>> I won't go to far here as I am not expert enough in VIMAGE, but
>>>> one question anyway:
>>>> 
>>>> - Can you correlate this kernel panic to a particular event?
>>>> Like for example a VIMAGE/VNET jail destruction.
>>>> 
>>>> I will test that on my side on a 10.2 machine.
>> 
>> I just got a response from adrian@ where he seems to remember that it
>> has all been fixed in head.
>> 
>> I would really prefer not to run a head kernel in production unless I
>> have to, so the question is if it is possible to pin down the
>> specific fixes for this problem? Any suggestions?
>> 
>> Thanks for all the help so far!
> 
> On my side, all issues we have found in TCP stack are currently both
> fixed in 10.2 and HEAD.  The remaining differences are only performance
> improvements that are solely in HEAD.  adrian@ might have more details
> on fixes he has in mind.

Hi, 10.2 gives us the same sort of crash as 10.1.

Vi are now testing releng/10.1 with these two patches merged:

https://svnweb.freebsd.org/changeset/base/287261

https://svnweb.freebsd.org/changeset/base/287780


We have yet to see a crash, so it is looking vaguelly promising, but we have to 
wait and see.

Palle

PS. I've failed to mention that except VIMAGE +jails, the jail host is an NFS 
client as well. They NFS shares are mounted from the jail host, not the jails 
(since that is not possible anyway). DS.




_______________________________________________
freebsd-net@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to "freebsd-net-unsubscr...@freebsd.org"

Reply via email to