Re: Core Dump / panic sleeping thread

2013-03-21 Thread Konstantin Belousov
On Thu, Mar 21, 2013 at 07:59:25PM +0100, Michael Landin Hostbaek wrote: > > On Mar 21, 2013, at 8:58 AM, Konstantin Belousov wrote: > > > On Wed, Mar 20, 2013 at 09:14:37PM -0400, Rick Macklem wrote: > >> Well, read/write sharing of files over NFS is pretty rare, so I s

Re: Nullfs leaks i-nodes

2013-05-08 Thread Konstantin Belousov
On Tue, May 07, 2013 at 08:30:06AM +0200, G??ran L??wkrantz wrote: > I created a PR, kern/178238, on this but would like to know if anyone has > any ideas or patches? > > Have updated the system where I see this to FreeBSD 9.1-STABLE #0 r250229 > and still have the problem. The patch below shou

Re: Apparent regression in r250359

2013-05-08 Thread Konstantin Belousov
On Wed, May 08, 2013 at 01:14:52PM -0400, Jim Ohlstein wrote: > Hello, > > I upgraded my (custom) kernel earlier and found that multiple daemons > (cups, hal, syslog, ntpd, csh) crashed and dumped cores at or shortly > after boot. > > The error I saw several times on the console was: > > set_f

Re: Nullfs leaks i-nodes

2013-05-09 Thread Konstantin Belousov
On Thu, May 09, 2013 at 09:02:56AM +0200, Peter Holm wrote: > On Wed, May 08, 2013 at 12:13:17PM +0300, Konstantin Belousov wrote: > > On Tue, May 07, 2013 at 08:30:06AM +0200, G??ran L??wkrantz wrote: > > > I created a PR, kern/178238, on this but would like to know if anyone ha

Re: Nullfs leaks i-nodes

2013-05-09 Thread Konstantin Belousov
On Thu, May 09, 2013 at 02:11:44PM +0200, Goran Lowkrantz wrote: > I assume this is CURRENT? Tried on STABLE but got this: > cc1: warnings being treated as errors > /usr/src/sys/kern/vfs_subr.c: In function 'vfs_notify_upper': > /usr/src/sys/kern/vfs_subr.c:2801: warning: implicit declaration of >

Re: Apparent regression in r250359

2013-05-09 Thread Konstantin Belousov
On Thu, May 09, 2013 at 10:13:15AM -0400, Jim Ohlstein wrote: > # sysctl hw.model > hw.model: AMD FX(tm)-8350 Eight-Core Processor Ahh, so it seems that this is a CPU with the LWP. Please try the patch at the end of message. As another workaround, which does not disable AVX support, you could try

Re: Apparent regression in r250359

2013-05-09 Thread Konstantin Belousov
On Thu, May 09, 2013 at 11:42:28AM -0400, Jim Ohlstein wrote: > On 05/09/13 10:30, Konstantin Belousov wrote: > > On Thu, May 09, 2013 at 10:13:15AM -0400, Jim Ohlstein wrote: > >> # sysctl hw.model > >> hw.model: AMD FX(tm)-8350 Eight-Core Processor > > Ahh, so

Re: Apparent regression in r250359

2013-05-09 Thread Konstantin Belousov
On Thu, May 09, 2013 at 01:17:28PM -0400, Jim Ohlstein wrote: > On 05/09/13 12:04, Konstantin Belousov wrote: > > Hm, I see another bug in the next line as well. Could you try this > > updated patch ? > > This does work. Committed to head, should be merged back to s

Re: Does quota still require a custom kernel in FreeBSD 9?

2013-06-13 Thread Konstantin Belousov
On Thu, Jun 13, 2013 at 09:44:15PM +0200, Peter Olsson wrote: > Just wondering if a custom kernel with options QUOTA > is still needed in FreeBSD 9. I have no test server > available right now to test this. > > According to the handbook it is needed, but I found > in a post from 2011 that it would

Re: system sporadically hangs on shutdown after switching to WITH_NEW_XORG

2013-06-16 Thread Konstantin Belousov
On Sun, Jun 16, 2013 at 05:11:15PM +0200, Michiel Boland wrote: > Hi. Recently I switched to WITH_NEW_XORG, primarily because the stock X > server > with Intel driver has some issues that make it unusable for me. > > The new X server and Intel driver works extremely well, so kudos to whoever >

Re: system sporadically hangs on shutdown after switching to WITH_NEW_XORG

2013-06-16 Thread Konstantin Belousov
On Sun, Jun 16, 2013 at 07:12:33PM +0200, Michiel Boland wrote: > On 06/16/2013 17:37, Konstantin Belousov wrote: > [...] > > I do not see anything related to i915 in the core.txt you provided. > > > > Next time the machine hangs, start with the output of ps command from &

Re: system sporadically hangs on shutdown after switching to WITH_NEW_XORG

2013-06-16 Thread Konstantin Belousov
On Sun, Jun 16, 2013 at 08:06:21PM +0200, Michiel Boland wrote: > On 06/16/2013 19:46, Konstantin Belousov wrote: > > On Sun, Jun 16, 2013 at 07:12:33PM +0200, Michiel Boland wrote: > >> On 06/16/2013 17:37, Konstantin Belousov wrote: > >> [...] > >>> I do

Re: system sporadically hangs on shutdown after switching to WITH_NEW_XORG

2013-06-17 Thread Konstantin Belousov
On Mon, Jun 17, 2013 at 09:16:56PM +0200, Michiel Boland wrote: > On 06/16/2013 17:11, Michiel Boland wrote: > > Hi. Recently I switched to WITH_NEW_XORG, primarily because the stock X > > server > > with Intel driver has some issues that make it unusable for me. > > > > The new X server and Intel

Re: FreeBSD-9.1: machine reboots during snapshot creation, LORs found

2013-07-03 Thread Konstantin Belousov
On Thu, Jul 04, 2013 at 07:14:09AM +0200, Andre Albsmeier wrote: > On Mon, 17-Jun-2013 at 21:30:31 +0200, John Baldwin wrote: > > On Sunday, June 16, 2013 2:39:42 am Andre Albsmeier wrote: > > > On Fri, 31-May-2013 at 16:51:03 +0200, John Baldwin wrote: > > > > On Friday, May 31, 2013 8:26:11 am An

Re: FreeBSD-9.1: machine reboots during snapshot creation, LORs found

2013-07-03 Thread Konstantin Belousov
On Thu, Jul 04, 2013 at 07:27:00AM +0200, Andre Albsmeier wrote: > On Thu, 04-Jul-2013 at 07:24:40 +0200, Konstantin Belousov wrote: > > On Thu, Jul 04, 2013 at 07:14:09AM +0200, Andre Albsmeier wrote: > > > On Mon, 17-Jun-2013 at 21:30:31 +0200, John Baldwin wrote: > >

Re: FreeBSD-9.1: machine reboots during snapshot creation, LORs found

2013-07-04 Thread Konstantin Belousov
On Thu, Jul 04, 2013 at 04:29:19PM +0200, Andre Albsmeier wrote: > OK, patch is applied. I will reboot the machine later > and see what happens tomorrow in the morning. However, > it might take a few days since the last 2 weeks all was > fine. > > BTW, should this patch be used in general or is it

Re: Phantom nulls in usbdevs.h during 9-STABLE kernel build

2013-07-04 Thread Konstantin Belousov
On Thu, Jul 04, 2013 at 07:38:24PM -0400, J David wrote: > On Thu, Jul 4, 2013 at 12:15 PM, Dimitry Andric wrote: > > > One other thing: which type of file system are you using for /usr/obj, or > > wherever you pointed $MAKEOBJDIRPREFIX? > > > > Also ZFS-over-NFS. The goal is to build on one ma

Re: FreeBSD-9.1: machine reboots during snapshot creation, LORs found

2013-07-07 Thread Konstantin Belousov
On Sun, Jul 07, 2013 at 09:25:53AM +0200, Andre Albsmeier wrote: > OK, here we go (looks better now): > > GNU gdb 6.1.1 [FreeBSD] > Copyright 2004 Free Software Foundation, Inc. > GDB is free software, covered by the GNU General Public License, and you are > welcome to change it and/or distribute

Re: Shutdown hangs on unmount of a gjournaled file system in 8-Stable

2013-07-07 Thread Konstantin Belousov
On Mon, Jul 08, 2013 at 12:26:43AM +0200, Andreas Longwitz wrote: > The deadlock can be explained now: pid 1 (init) sleeps on "mount drain" > because mp->mnt_lockref was 1. This setting was done by pid 18 (gjournal > switcher) by calling vfs_busy(). pid 18 now sleeps on "suspwt" because > mp->mnt_w

Re: Weird regex behavior on 9.1-RELEASE on amd64 in 32-bit mode

2013-07-09 Thread Konstantin Belousov
On Tue, Jul 09, 2013 at 08:08:22PM +0200, Michael Sperber wrote: > > I noticed that scsh (which only runs in 32-bit mode) fails on amd64. I > narrowed it down to a regex malfunction (I think). This program: > > snip > #include > #include > > int > main(void) > { > regex_t r; > in

Re: FreeBSD-9.1: machine reboots during snapshot creation, LORs found

2013-07-11 Thread Konstantin Belousov
On Fri, Jul 12, 2013 at 07:24:40AM +0200, Andre Albsmeier wrote: > On Thu, 04-Jul-2013 at 19:25:28 +0200, Konstantin Belousov wrote: > > On Thu, Jul 04, 2013 at 04:29:19PM +0200, Andre Albsmeier wrote: > > > OK, patch is applied. I will reboot the machine later > > > an

Re: FreeBSD-9.1: machine reboots during snapshot creation, LORs found

2013-07-11 Thread Konstantin Belousov
On Fri, Jul 12, 2013 at 08:05:27AM +0200, Andre Albsmeier wrote: > On Fri, 12-Jul-2013 at 08:01:12 +0200, Konstantin Belousov wrote: > > On Fri, Jul 12, 2013 at 07:24:40AM +0200, Andre Albsmeier wrote: > > > On Thu, 04-Jul-2013 at 19:25:28 +0200, Konstantin Belousov wrote: >

Re: (9.2) panic under disk load (gam_server / knlist_remove_kq)

2013-07-15 Thread Konstantin Belousov
On Mon, Jul 15, 2013 at 06:50:09PM +0200, Patrick Lamaiziere wrote: > Le Mon, 15 Jul 2013 16:26:47 +0200, > Mateusz Guzik a ?crit : > > Hello, > > > > > I'm seeing a panic while trying to build a poudriere repository. > > > > > > > > As far I can see it always happens when gam_server is started

Re: (9.2) panic under disk load (gam_server / knlist_remove_kq)

2013-07-16 Thread Konstantin Belousov
On Tue, Jul 16, 2013 at 10:14:36PM +0200, Patrick Lamaiziere wrote: > Le Tue, 16 Jul 2013 09:05:55 +0300, > Konstantin Belousov a ?crit : > > Hello, > > Thanks Konstantin. I'm trying your patch and that looks better. > poudriere runs since 3 hours now (before the box p

Re: syncer causing latency spikes

2013-07-17 Thread Konstantin Belousov
On Wed, Jul 17, 2013 at 02:07:55PM -0400, Mark Johnston wrote: > Hello, > > I'm trying to investigate and solve some postgres latency spikes that > I'm seeing as a result of some behaviour in the syncer. This is with > FreeBSD 8.2 (with some local modifications and backports, r231160 in > particul

Re: FreeBSD-9.1: machine reboots during snapshot creation, LORs found

2013-07-19 Thread Konstantin Belousov
On Fri, Jul 19, 2013 at 07:32:47AM +0200, Andre Albsmeier wrote: > On Thu, 04-Jul-2013 at 19:25:28 +0200, Konstantin Belousov wrote: > > Patch is to improve debugging. > > > > I probably commit it after the issue is closed. Arguments against > > the commit is th

Re: stopping amd causes a freeze

2013-07-22 Thread Konstantin Belousov
On Mon, Jul 22, 2013 at 11:50:24AM +0200, Dominic Fandrey wrote: > Occasionally stopping amd freezes my system. It's a rare occurrence, > and I haven't found a reliable way to reproduce it. > > It's also a real freeze, so there's no way to get into the debugger > or grab a core dump. I only can pe

Re: stopping amd causes a freeze

2013-07-25 Thread Konstantin Belousov
On Thu, Jul 25, 2013 at 09:56:59AM +0200, Dominic Fandrey wrote: > On 22/07/2013 12:07, Konstantin Belousov wrote: > > On Mon, Jul 22, 2013 at 11:50:24AM +0200, Dominic Fandrey wrote: > >> ... > >> > >> I run amd through sysutils/automounter, which is a scrip

Re: NFS deadlock on 9.2-Beta1

2013-07-27 Thread Konstantin Belousov
On Sat, Jul 27, 2013 at 04:20:49PM -0400, Rick Macklem wrote: > Michael Tratz wrote: > > > > On Jul 24, 2013, at 5:25 PM, Rick Macklem > > wrote: > > > > > Michael Tratz wrote: > > >> Two machines (NFS Server: running ZFS / Client: disk-less), both > > >> are > > >> running FreeBSD r253506. The

Re: stopping amd causes a freeze

2013-07-27 Thread Konstantin Belousov
On Sat, Jul 27, 2013 at 10:33:18AM +0200, Dominic Fandrey wrote: > On 26/07/2013 19:10, Dominic Fandrey wrote: > > On 25/07/2013 12:00, Konstantin Belousov wrote: > >> On Thu, Jul 25, 2013 at 09:56:59AM +0200, Dominic Fandrey wrote: > >>> On 22/07/2013 12:07, Konsta

Re: NFS deadlock on 9.2-Beta1

2013-07-27 Thread Konstantin Belousov
On Sat, Jul 27, 2013 at 03:13:05PM -0700, Michael Tratz wrote: > Let's assume the pid which started the deadlock is 14001 (it will be a > different pid when we get the results, because the machine has been restarted) > > I type: > > show proc 14001 > > I get the thread numbers from that output

Re: [releng_9 tinderbox] failure on amd64/amd64

2013-08-07 Thread Konstantin Belousov
On Wed, Aug 07, 2013 at 01:09:08PM +, FreeBSD Tinderbox wrote: > /src/sys/amd64/amd64/machdep.c: In function 'db_show_sysregs': > /src/sys/amd64/amd64/machdep.c:1226: error: 'MSR_IA32_FEATURE_CONTROL' > undeclared (first use in this function) Should be fixed with the r254066, sorry for the br

Re: NFS deadlock on 9.2-Beta1

2013-08-21 Thread Konstantin Belousov
On Tue, Aug 20, 2013 at 06:18:16PM -0400, Rick Macklem wrote: > J David wrote: > > On Thu, Aug 15, 2013 at 5:39 PM, Rick Macklem > > wrote: > > > Have you been able to pass the debugging info on to Kostik? > > > > > > It would be really nice to get this fixed for FreeBSD9.2. > > > > You're probab

Re: NFS deadlock on 9.2-Beta1

2013-08-21 Thread Konstantin Belousov
On Wed, Aug 21, 2013 at 08:03:35PM +0200, Yamagi Burmeister wrote: > Could the problem be related to this deadlock / LOR? - > http://lists.freebsd.org/pipermail/freebsd-fs/2013-August/018052.html This is not related. > > My test setup is still in place. Will test with r250907 reverted > tomorrow

Re: NFS deadlock on 9.2-Beta1

2013-08-22 Thread Konstantin Belousov
On Wed, Aug 21, 2013 at 09:08:10PM -0400, Rick Macklem wrote: > Kostik wrote: > > On Tue, Aug 20, 2013 at 06:18:16PM -0400, Rick Macklem wrote: > > > J David wrote: > > > > On Thu, Aug 15, 2013 at 5:39 PM, Rick Macklem > > > > > > > > wrote: > > > > > Have you been able to pass the debugging info

Re: NFS deadlock on 9.2-Beta1

2013-08-24 Thread Konstantin Belousov
On Sat, Aug 24, 2013 at 01:08:05PM -0400, J David wrote: > The requested information about the deadlock was finally obtained and > provided off-list to the requested parties due to size. Thank you, the problem is clear now. The problematic process backtrace is Tracing command httpd pid 86383 tid

Re: NFS deadlock on 9.2-Beta1

2013-08-24 Thread Konstantin Belousov
On Sat, Aug 24, 2013 at 02:03:50PM -0400, J David wrote: > On Sat, Aug 24, 2013 at 1:41 PM, Konstantin Belousov > wrote: > > I think the easiest route is to a partial merge of the r253927 from HEAD. > > Is it helpful if we restart testing releng/9.2 using your suggested >

Re: NFS deadlock on 9.2-Beta1

2013-08-24 Thread Konstantin Belousov
On Sat, Aug 24, 2013 at 04:11:09PM -0400, J David wrote: > On Sat, Aug 24, 2013 at 3:38 PM, Konstantin Belousov > wrote: > > No, at least not without reverting the r254754 first. The IGN_SBUSY patch > > is not critical there. > > There is lots of other stuff in r2509

Re: Stack overflow with kernel r254683

2013-08-26 Thread Konstantin Belousov
On Mon, Aug 26, 2013 at 07:11:48PM -0400, Rick Macklem wrote: > Matthias Schuendehuette wrote: > > Hello, > > > > yesterday I got a kernel crash on my server (a ProLiant DL380 G5): > > > > "panic: stack overflow detected; backtrace may be corrupted" > > > > Kernel is "9.2-PRERELEASE FreeBSD 9.2-

Re: another? NFS deadlock on 9.2-PRERELEASE

2013-08-27 Thread Konstantin Belousov
On Tue, Aug 27, 2013 at 05:00:14PM +0300, Daniel Braniss wrote: > > Daniel Braniss wrote: > > > > Daniel Braniss wrote: > > > > > I upgraded our web server, and only after 3 hours it hung :-( > > > > > (as a side note, I have 2 other web servers, also running 9.2 > > > > > doing > > > > > great :-)

Re: 9-STABLE panic on intensive fork

2013-08-29 Thread Konstantin Belousov
On Wed, Aug 28, 2013 at 06:20:29PM +0400, Dmitry Sivachenko wrote: > Hello! > > I am using very recent FreeBSD-9-STABLE snapshot: > 9.2-PRERELEASE FreeBSD 9.2-PRERELEASE #0 r254986: Wed Aug 28 17:18:57 MSK 2013 > > I run uwsgi program (ports/www/uwsgi) on that machine. > > When uwsgi starts, it

Re: 9.2-STABLE: supervisor read data, page not present

2013-09-08 Thread Konstantin Belousov
On Sun, Sep 08, 2013 at 11:40:01PM +0200, Oliver Pinter wrote: > #6 0x806a2ab3 in cpuctl_ioctl (dev=, > cmd=, data=, flags=0, > td=) at /usr/src/sys/dev/cpuctl/cpuctl.c:478 > cpu = > ret = Do you indeed posses VIA CPU ? Was it due to some violence act ? I am not

Re: 9.2-STABLE: supervisor read data, page not present

2013-09-09 Thread Konstantin Belousov
On Mon, Sep 09, 2013 at 08:45:21AM +0200, Oliver Pinter wrote: > On 9/9/13, Konstantin Belousov wrote: > > On Sun, Sep 08, 2013 at 11:40:01PM +0200, Oliver Pinter wrote: > >> #6 0x806a2ab3 in cpuctl_ioctl (dev=, > >> cmd=, data=, flags=0, > >>

Re: 9.2-STABLE: supervisor read data, page not present

2013-09-09 Thread Konstantin Belousov
On Mon, Sep 09, 2013 at 05:39:22PM +0200, Oliver Pinter wrote: > Thanks, this patch fixed the issue. Which issue among the two problems you reported ? I assume that cpuctl panic is gone, but what about proc_reap issue ? pgp_2jaDyVSvF.pgp Description: PGP signature

Re: Possible kqueue related issue on STABLE/RC.

2013-09-11 Thread Konstantin Belousov
On Wed, Sep 11, 2013 at 05:07:10PM +0200, Jimmy Olgeni wrote: > - However, this time I managed to get a minidump from the old -STABLE. I >saved it here: > > http://olgeni.olgeni.com/~olgeni/core.txt.0 Could you list the lines around the the vfs_subr.c:4591 in your kernel ? Also, do you h

Re: Possible kqueue related issue on STABLE/RC.

2013-09-12 Thread Konstantin Belousov
On Wed, Sep 11, 2013 at 11:18:34PM +0200, Jimmy Olgeni wrote: > > Hi, > > On Wed, 11 Sep 2013, Konstantin Belousov wrote: > > > Also, do you have all options listed at > > http://www.freebsd.org/doc/en_US.ISO8859-1/books/developers-handbook/kerneldebug-deadlocks.h

Re: Possible kqueue related issue on STABLE/RC.

2013-09-12 Thread Konstantin Belousov
On Wed, Sep 11, 2013 at 10:32:31PM +0200, Jimmy Olgeni wrote: > > Hi, > > On Wed, 11 Sep 2013, Konstantin Belousov wrote: > > > Could you list the lines around the the vfs_subr.c:4591 in your kernel ? > > 4591 is the VI_LOCK(vp) in filt_vfsvnode: > > static

Re: Possible kqueue related issue on STABLE/RC.

2013-09-12 Thread Konstantin Belousov
On Thu, Sep 12, 2013 at 08:28:48PM +0200, Jimmy Olgeni wrote: > > On Thu, 12 Sep 2013, Konstantin Belousov wrote: > > > Might be, your issue is that some filesystems do not care about proper > > locking mode for the fifos. UFS carefully disables shared locking for > >

Re: Possible kqueue related issue on STABLE/RC.

2013-09-12 Thread Konstantin Belousov
On Fri, Sep 13, 2013 at 12:40:28AM +0300, Andriy Gapon wrote: > on 12/09/2013 21:49 Konstantin Belousov said the following: > > Ok, so it is ZFS indeed. I think I will commit the band-aid to head > > shortly. > > I am not sure if my message <5231a016.7060...@freebsd.org&g

Re: Possible kqueue related issue on STABLE/RC.

2013-09-23 Thread Konstantin Belousov
On Mon, Sep 23, 2013 at 03:37:08PM +0200, Patrick Lamaiziere wrote: > Le Fri, 20 Sep 2013 15:17:05 +0200, > Patrick Lamaiziere a ?crit : > > > Le Thu, 12 Sep 2013 10:36:43 +0300, > > Konstantin Belousov a ?crit : > > > > Hello, > > > > > Migh

Re: Possible kqueue related issue on STABLE/RC.

2013-09-24 Thread Konstantin Belousov
On Tue, Sep 24, 2013 at 09:44:27AM +0200, Patrick Lamaiziere wrote: > Le Mon, 23 Sep 2013 23:31:41 +0300, > Konstantin Belousov a ?crit : > > Hello, > > ... > > > > > Ok This has been mfced to 9.2-STABLE. But I still see this panic > > > with 9-2/STA

Re: Possible kqueue related issue on STABLE/RC.

2013-09-24 Thread Konstantin Belousov
On Tue, Sep 24, 2013 at 11:47:38AM +0200, Patrick Lamaiziere wrote: > Le Tue, 24 Sep 2013 11:29:09 +0300, > Konstantin Belousov a ?crit : > > Hello, > > ... > > > > > > Ok This has been mfced to 9.2-STABLE. But I still see this panic > > > > >

Re: Possible kqueue related issue on STABLE/RC.

2013-09-24 Thread Konstantin Belousov
On Tue, Sep 24, 2013 at 10:45:17AM -0700, John-Mark Gurney wrote: > I'd like to understand why you think protecting these functions w/ > the _DETACHED check is correct... In kern_event.c, all calls to > f_detach are followed by knote_drop which will ensure that the knote > is removed and free, so

Re: Possible kqueue related issue on STABLE/RC.

2013-09-25 Thread Konstantin Belousov
On Wed, Sep 25, 2013 at 09:58:05AM +0200, Patrick Lamaiziere wrote: > Le Wed, 25 Sep 2013 00:21:27 +0300, > Konstantin Belousov a ?crit : > > Hello, > > > On Tue, Sep 24, 2013 at 10:45:17AM -0700, John-Mark Gurney wrote: > > > I'd like to understand why you

Re: Possible kqueue related issue on STABLE/RC.

2013-09-25 Thread Konstantin Belousov
On Wed, Sep 25, 2013 at 09:19:54AM -0700, John-Mark Gurney wrote: > Konstantin Belousov wrote this message on Wed, Sep 25, 2013 at 00:21 +0300: > > On Tue, Sep 24, 2013 at 10:45:17AM -0700, John-Mark Gurney wrote: > > > I'd like to understand why you think protecting these

Re: 9.2-RC4 amd64 panic: vm_page_unwire

2013-09-27 Thread Konstantin Belousov
On Fri, Sep 27, 2013 at 10:07:28AM +1000, John Marshall wrote: > I'm running 9.2-RC4 on a handful of desktop and server machines (both > i386 and amd64). I have seen three panics (all vm_page_unwire) on one > of those systems only (amd64 server) during the past week. > > The first two panics were

Re: Device timeout from mfi(9) while booting 9.2-RELEASE

2013-10-03 Thread Konstantin Belousov
On Thu, Oct 03, 2013 at 11:50:15AM -0400, Ryan Stone wrote: > We're having trouble booting 9.2-RELEASE on a system with an mfi > (Drake Skinny) controller. When it tries to mount root we get: > > mfi0: COMMAND 0xff8000cb7220 TIMEOUT AFTER 59 SECONDS > mfi0: COMMAND 0xff8000cb72a8 TIMEOUT

Re: Device timeout from mfi(9) while booting 9.2-RELEASE

2013-10-03 Thread Konstantin Belousov
On Fri, Oct 04, 2013 at 11:23:45AM +1000, Jan Mikkelsen wrote: > Hi, > > We have Drake Skinny (9240-4i and 9240-8i) cards running, most recently with > 9.2-RC4. We'll be testing 9.2-RELEASE in the next week or two. > > We have seen problems with older versions of the firmware, and interactions

Re: Device timeout from mfi(9) while booting 9.2-RELEASE

2013-10-03 Thread Konstantin Belousov
On Fri, Oct 04, 2013 at 03:53:10PM +1000, Jan Mikkelsen wrote: > >> We have Drake Skinny (9240-4i and 9240-8i) cards running, most recently > >> with 9.2-RC4. We'll be testing 9.2-RELEASE in the next week or two. > >> > >> We have seen problems with older versions of the firmware, and > >> inter

Re: 9.2-RC4 amd64 panic: vm_page_unwire

2013-10-05 Thread Konstantin Belousov
On Sat, Oct 05, 2013 at 05:16:08PM +1000, John Marshall wrote: Since you have a will to play with this, please try to bisect the stable to see which commit introduced the regression (you said that 9.1 does not panic). pgpCeXKhcpYky.pgp Description: PGP signature

Re: Suspected libkvm infinite loop

2015-03-12 Thread Konstantin Belousov
On Wed, Mar 11, 2015 at 09:34:07PM -0700, Mark Johnston wrote: > On Thu, Mar 12, 2015 at 02:05:32PM +1000, Nick Frampton wrote: > > On 12/03/15 00:38, John Baldwin wrote: > > >>> It sounds like this issue might be the one fixed in r272566: if the > > >>> > >KERN_PROC_ALL sysctl is read with an insu

Re: Significant memory leak in 9.3p10?

2015-03-16 Thread Konstantin Belousov
On Mon, Mar 16, 2015 at 06:59:33PM -0400, J David wrote: > Recently we have seen a large-scale memory leak on amd64 machines > running FreeBSD 9.3-RELEASE-p10. > > This was first observed on 9.3p2 but has since shown up all the way through > p10. > > Here's what the header of top shows: > > las

Re: RELENG_10 performance regression (was Re: 35-40% performance drop releng9 vs releng10 openvpn

2015-03-20 Thread Konstantin Belousov
On Fri, Mar 20, 2015 at 04:53:42PM -0700, Xin Li wrote: > -BEGIN PGP SIGNED MESSAGE- > Hash: SHA512 > > On 03/20/15 14:02, Mike Tancsa wrote: > > OK, I think I found where the RELENG_10 performance loss happened. > > It seems > > https://lists.freebsd.org/pipermail/svn-src-stable-10/2015-

Re: RELENG_10 performance regression (was Re: 35-40% performance drop releng9 vs releng10 openvpn

2015-03-21 Thread Konstantin Belousov
On Sat, Mar 21, 2015 at 02:13:06PM -0400, Mike Tancsa wrote: > On 3/21/2015 11:52 AM, John Baldwin wrote: > > >> http://tancsa.com/time/ > > > > Do you know why you are using the HPET instead of TSC for timestamping? > > Hi, > > I am not consciously making any time keep decisions. > > kern.even

Re: Significant memory leak in 9.3p10?

2015-03-27 Thread Konstantin Belousov
On Thu, Mar 26, 2015 at 03:46:05PM -0400, J David wrote: > On Mon, Mar 16, 2015 at 7:52 PM, J David wrote: > > On Mon, Mar 16, 2015 at 7:24 PM, Konstantin Belousov > > wrote: > >> There are a lot of possibilities to create persistent anonymous shared > >> memo

Re: protecting some processes from out-of-swap killer

2015-04-25 Thread Konstantin Belousov
On Sat, Apr 25, 2015 at 01:31:14PM +0300, Dmitry Morozovsky wrote: > Hi there colleagues, > > I have stable/10 on a rather big machine (2*8*2 e5 Xeon, 64G RAM, SAS+SSD ZFS > raid10+ZIL+L2ARC) acting as a PostgreSQL server. 64G RAM is not big by modern standards, normally populated desktop can ta

Re: Many core dumps in pthread_getspecific.

2015-06-03 Thread Konstantin Belousov
On Wed, Jun 03, 2015 at 01:34:04PM +0200, Andre Meiser wrote: > Hi, > > I'm new to FreeBSD and I'm running into a lot of core dumps (mainly from > Xorg, but also from vim, firefox, ...). > That's why I asked on freebsd-questions: > > https://lists.freebsd.org/pipermail/freebsd-questions/2015-Jun

Re: Many core dumps in pthread_getspecific.

2015-06-06 Thread Konstantin Belousov
On Sat, Jun 06, 2015 at 09:50:22AM +0200, Andre Meiser wrote: > On Wed, Jun 03, 2015 at 16:58 +0200, Konstantin Belousov wrote: > > You should recompile both libc and libthr with debugging symbols, like > > cd /usr/src > > (cd lib/libc && make all install DEBUG_FLAGS=-

Re: Re: Many core dumps in pthread_getspecific.

2015-06-14 Thread Konstantin Belousov
On Fri, Jun 12, 2015 at 12:03:16PM +0200, Andre Meiser wrote: > Hi, > > On Wed, Jun 03, 2015 at 16:58 +0200, Konstantin Belousov wrote: > > You should recompile both libc and libthr with debugging symbols, like > > cd /usr/src > > (cd lib/libc && make all

Re: Many core dumps in pthread_getspecific.

2015-06-16 Thread Konstantin Belousov
On Tue, Jun 16, 2015 at 09:09:25AM +0200, Andre Meiser wrote: > On Sun, Jun 14, 2015 at 21:05 +0200, Konstantin Belousov wrote: > > Show me the 'ldd vim' output and output from the > > readelf -d vim | grep NEEDED. > > % ldd /usr/local/bin/vim > /usr/local/

Re: panic: wm_page_unwire

2015-06-20 Thread Konstantin Belousov
On Sat, Jun 20, 2015 at 10:23:39AM +0300, Dmitry Sivachenko wrote: > > > On 19 июня 2015 г., at 22:57, Dmitry Sivachenko wrote: > > > > Hello, > > > > got this panic today on my 10.1-STABLE #0 r279956 box: > > > > > > > Well, I tracked this down a bit. Rather easy way to panic -stable box

Re: Many core dumps in pthread_getspecific.

2015-07-03 Thread Konstantin Belousov
On Fri, Jul 03, 2015 at 05:21:50PM +0200, Andre Meiser wrote: > Hi, > > back again. Sorry, I accidently deleted the core file and I'd to wait two > weeks until vim crashed again. Xorg didn't crashed so far with the debug libs. > > On Tue, Jun 16, 2015 at 09:36 +020

Re: amd64 kernel dynamic linking allows extern references to statics

2015-07-15 Thread Konstantin Belousov
On Wed, Jul 15, 2015 at 06:17:20PM +1000, Jan Mikkelsen wrote: > Hi, > > (All on 10.2-BETA1.) > > I noticed that the latest patch in the bug > https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=187594 > works on amd64 but > fails to loa

Re: amd64 kernel dynamic linking allows extern references to statics

2015-07-16 Thread Konstantin Belousov
On Thu, Jul 16, 2015 at 09:18:15AM +1000, Jan Mikkelsen wrote: > > > On 15 Jul 2015, at 11:27 pm, Konstantin Belousov > > wrote: > > > > On Wed, Jul 15, 2015 at 06:17:20PM +1000, Jan Mikkelsen wrote: > >> Hi, > >> > >> (All on 10.2-BETA1.)

Re: Many core dumps in pthread_getspecific.

2015-07-19 Thread Konstantin Belousov
On Wed, Jul 15, 2015 at 06:32:13PM +0200, Andre Meiser wrote: > Hi, > > no crash from vim or Xorg but from xterm and again at getcontext(uc) after > alloca: > > % readelf -d xterm | grep NEEDED > 0x0001 (NEEDED) Shared library: [libXinerama.so.1] > 0x0001 (N

Re: Many core dumps in pthread_getspecific.

2015-07-22 Thread Konstantin Belousov
On Wed, Jul 22, 2015 at 11:46:35AM +0200, Andre Meiser wrote: > On Sun, Jul 19, 2015 at 22:57 +0200, Konstantin Belousov wrote: > > It seems that besides sigreturn(), ucontext symbols must be pre-resolved > > as well. Try this update (it includes the previous change). > > th

Re: 10.2: ntp update breaks DCF77 clock

2015-08-17 Thread Konstantin Belousov
On Sun, Aug 16, 2015 at 10:49:57PM -0700, Cy Schubert wrote: > qemu-sbruno) doesn't support all our supported platforms, especially the > multitude of ARM platforms, so holes in our auto-generated config.h support > will exist. I believe that the userspace arm ABI is not that variable. There co

Re: ia64 regression from r285809 to 286952: kernel does not boot

2015-08-23 Thread Konstantin Belousov
On Sun, Aug 23, 2015 at 03:51:58PM +0100, Anton Shterenlikht wrote: > I'm not sure if ia64 list is still operational, so > post here. > > ia64 r285809 10-STABLE works fine. > I updated to r286952, buildworld, buildkernel, installkernel, > reboot, boot -s, and I get stuck with: > > FreeBSD/ia64 EF

Re: ia64 stable/10 r286316: hang at Entering /boot/kernel/kernel

2015-08-27 Thread Konstantin Belousov
On Thu, Aug 27, 2015 at 01:12:16PM +0100, Anton Shterenlikht wrote: > ia64 stable/10 r286315 boots, but > r286316 hangs at "Entering /boot/kernel/kernel". > > Please advise To state an obvious thing. The commit which you pointed to, changes the code which is not executed at that early kernel bo

Re: Latest stable (r287104) bash leaves zombies on exit

2015-08-27 Thread Konstantin Belousov
On Thu, Aug 27, 2015 at 02:04:05PM +0200, Mark Martinec wrote: > Pete French wrote: > > > I updated to stable yesterday, plus updated all my porst to > > the latest pecompiled packages, but I am now seeing odd problems > > with bash on exit. Sometimes it quits, but leaves a zombie > > process... e

Re: Latest stable (r287104) bash leaves zombies on exit

2015-08-27 Thread Konstantin Belousov
On Thu, Aug 27, 2015 at 06:06:26PM +0100, Pete French wrote: > > Try to obtain the backtrace from the hung xterm. Ideally, you would > > rebuild xterm and the system libraries (rtld+libc+libthr) with debug > > symbols and get the backtraces after that. > > I can try this tomorrow - what do I need

Re: Latest stable (r287104) bash leaves zombies on exit

2015-08-27 Thread Konstantin Belousov
On Thu, Aug 27, 2015 at 08:53:09PM +0200, Michiel Boland wrote: > The xterm program has a SIGCHLD signal handler that calls wait(). > If the handler is invoked while xterm is exiting, a deadlock occurs in rtld. > > Cheers > Michiel > > #0 _umtx_op_err () at /usr/src/lib/libthr/arch/amd64/amd64/_

Re: Latest stable (r287104) bash leaves zombies on exit

2015-08-28 Thread Konstantin Belousov
On Fri, Aug 28, 2015 at 08:08:27AM +0200, Michiel Boland wrote: > On 08/27/2015 22:16, Konstantin Belousov wrote: > [...] > > I just verified that the signal handler is correctly wrapped for me, on > > the latest stable/10. Both with the pre-linked libthr.so and with the

Re: ia64 stable/10 r286316: hang at Entering /boot/kernel/kernel

2015-08-28 Thread Konstantin Belousov
On Fri, Aug 28, 2015 at 11:30:18AM +0100, Anton Shterenlikht wrote: > >From kostik...@gmail.com Thu Aug 27 18:22:37 2015 > > > >On Thu, Aug 27, 2015 at 01:12:16PM +0100, Anton Shterenlikht wrote: > >> ia64 stable/10 r286315 boots, but > >> r286316 hangs at "Entering /boot/kernel/kernel". > >> > >

Re: Latest stable (r287104) bash leaves zombies on exit

2015-08-28 Thread Konstantin Belousov
On Fri, Aug 28, 2015 at 05:52:42PM +0200, Michiel Boland wrote: > set -e > for a in `seq 1000` > do > echo -n "$a " > xterm -e ssh nonexisting > done > echo "" > > (The idea here is that 'ssh nonexisting' should do some work and then exit, > "xterm -e false", etc. don't appear to trigger the bug.

Re: Latest stable (r287104) bash leaves zombies on exit

2015-08-29 Thread Konstantin Belousov
On Sat, Aug 29, 2015 at 01:43:36PM +0200, Michiel Boland wrote: > Do I understand correctly that the problem is that if you install a signal > handler with signal() (which is what xterm does) and pull in libthr.so > somehow, > then there is no thr_sighandler inserted? Yes. The problem does not

Re: Latest stable (r287104) bash leaves zombies on exit

2015-08-29 Thread Konstantin Belousov
On Sat, Aug 29, 2015 at 03:01:38PM +0200, Jilles Tjoelker wrote: > Looks good to me, except that I think a vforked child (in system() and > posix_spawn*()) should use the system calls and not libthr's wrappers. > This reduces the probability of weird things happening between vfork and > exec, and a

Re: Latest stable (r287104) bash leaves zombies on exit

2015-08-29 Thread Konstantin Belousov
On Sat, Aug 29, 2015 at 04:02:43PM +0200, Michiel Boland wrote: > I verified the patch. The getumask part of lib/libc/gen/setmode.c part was > rejected on stable/10 (probably due to other changes in ^/head.) Thank you. The setmode bits are from the Jilles' r280713. I will merge this revision wh

Re: pthread_cancel / sleep change in behaviour

2015-09-03 Thread Konstantin Belousov
On Thu, Sep 03, 2015 at 08:00:47AM +0800, Erich Dollansky wrote: > Hi, > > I noticed a change in behaviour when calling pthread_cancel for a > thread which is sleeping using i.e. sleep (). > > How I understand pthread_cancel, it should cancel the thread and call > the clean up handler soon after

Re: pthread_cancel / sleep change in behaviour

2015-09-03 Thread Konstantin Belousov
On Thu, Sep 03, 2015 at 06:33:53PM +0800, Erich Dollansky wrote: > Hi, > > On Thu, 3 Sep 2015 11:19:47 +0300 > Konstantin Belousov wrote: > > > On Thu, Sep 03, 2015 at 08:00:47AM +0800, Erich Dollansky wrote: > > > > > > Is this change of behaviour

Re: pthread_cancel / sleep change in behaviour

2015-09-04 Thread Konstantin Belousov
On Fri, Sep 04, 2015 at 08:37:40AM +0800, Erich Dollansky wrote: > Hi, > > On Thu, 3 Sep 2015 13:43:51 +0300 > Konstantin Belousov wrote: > > On Thu, Sep 03, 2015 at 06:33:53PM +0800, Erich Dollansky wrote: > > > On Thu, 3 Sep 2015 11:19:47 +0300 > > > Konsta

Re: pthread_cancel / sleep change in behaviour

2015-09-04 Thread Konstantin Belousov
On Fri, Sep 04, 2015 at 04:35:24PM +0800, Erich Dollansky wrote: > Hi,. > > On Fri, 4 Sep 2015 11:18:57 +0300 > Konstantin Belousov wrote: > > > On Fri, Sep 04, 2015 at 08:37:40AM +0800, Erich Dollansky wrote: > > > On Thu, 3 Sep 2015 13:43:51 +0300 &

Re: loader lsdev crashes loader (Was: head -r338804 boots threadripper 1950X fine; head -r338810+ do not; -r338807 seems implicated)

2018-10-23 Thread Konstantin Belousov
On Tue, Oct 23, 2018 at 08:54:24AM -0600, Warner Losh wrote: > On Tue, Oct 23, 2018 at 5:54 AM Toomas Soome wrote: > > > > > > On 23 Oct 2018, at 13:53, Lev Serebryakov wrote: > > > > > > On 22.10.2018 12:27, Toomas Soome wrote: > > > > > >> It would help to get output from loader lsdev -v comma

Re: Possible memory leak in the kernel (contigmalloc)

2018-10-26 Thread Konstantin Belousov
On Wed, Oct 24, 2018 at 04:27:52PM +, Bennett, Ciunas wrote: > Hello, > > I have encountered an issue with a kernel application that I have > written, the issue might be caused by a memory leak in the kernel. > The application allocates and deallocates contiguous memory using > contigmalloc()

Re: Possible memory leak in the kernel (contigmalloc)

2018-10-30 Thread Konstantin Belousov
on, which cause creation of the inactive memory and unrecoverable fragmentation. Note that contigmalloc() tries to do defragmentation to satisfy requests, but this is not always possible. > Ciunas. > > -Original Message- > From: Konstantin Belousov [mailto:kostik...@gmail.com] > Se

Re: Possible memory leak in the kernel (contigmalloc)

2018-10-30 Thread Konstantin Belousov
er possibility is that the memory is the caching pages from vnodes, but for this buffers must be created and then reclaimed, which would suggest even more activity on the system. > Thanks. > > -Original Message- > From: Konstantin Belousov [mailto:kostik...@gmail.com] > Sent

Re: Panic (gpf) in early boot after upgrading FreeBSD 10.4 -> 11.2 on Ganeti

2018-11-11 Thread Konstantin Belousov
On Sun, Nov 11, 2018 at 11:24:48PM -0500, Rob Austein wrote: > Belated upgrade (don't ask) of a pair of FreeBSD 10.4 VMs to 11.2. > Each VM got as far as: > > freebsd-update -r 11.2-RELEASE update > freebsd-update install > reboot > > Each VM got an immediate kernel panic after the reboot (

Re: Trap 12 in vm_page_alloc_after()

2018-11-18 Thread Konstantin Belousov
On Sun, Nov 18, 2018 at 08:24:38PM -0500, Garrett Wollman wrote: > Has anyone seen this before? It's on a busy NFS server, but hasn't > been observed on any of our other NFS servers. > > > Fatal trap 12: page fault while in

Re: /dev/crypto not being used in 12-STABLE

2018-12-06 Thread Konstantin Belousov
On Thu, Dec 06, 2018 at 04:48:35PM -0700, John Nielsen wrote: > Is aesni(4) even required if all you want is userland acceleration? > No, it is not. Same for rdrand_rng(4), if an application uses hw random source directly. ___ freebsd-stable@freebsd.org

Re: Address Collision using i386 4G/4G Memory Split

2018-12-17 Thread Konstantin Belousov
On Mon, Dec 17, 2018 at 02:51:48PM +0100, Alexander Lochmann wrote: > Hi folks! > > According to git commit e3089a (https://reviews.freebsd.org/D1463) > FreeBSD 12.0 i386 uses separate address spaces for kernel and user > space. So basically two memory areas, one in each space, can have the > same

<    1   2   3   4   >