On Fri, Dec 29, 2017 at 4:37 AM, Chuck Silvers <c...@chuq.com> wrote:
> On Thu, Dec 28, 2017 at 05:34:27PM +0900, Ryota Ozaki wrote:
>> On Thu, Dec 28, 2017 at 5:05 PM, Tom Ivar Helbekkmo
>> <t...@hamartun.priv.no> wrote:
>> > Ryota Ozaki <ozak...@netbsd.org> writes:
>> >
>> >> I think the below patch fixes the above issue, but probably
>> >> there is a better solution.
>> >
>> > Looks like didn't -- it just changed it a little bit.  Just like the
>> > last time, the hang happened while reading email over IMAP, which
>> > exercises disk and network at the same time, while the machine was busy
>> > doing a parallellized system build in the background.  This time,
>> > though, I got a core dump.  Here's the hang (the active process on this
>> > CPU is the IMAP server):
>>
>> Oh, my patch failed to keep SPL at IPL_VM because mutex_exit
>> tries to restore an SPL where mutex_enter is called. So I had to
>> put splvm before mutex_enter. Could you try the 2nd patch:
>>   http://www.netbsd.org/~ozaki-r/fix-pool_catchup.diff
>
> let's not mix explicit spl* calls with mutexes... even if that works
> in this particular case, it doesn't seem like a good practice in general.
> I was hoping to commit a different fix for this yesterday but I ran out of 
> time.
> I should have time this evening to get back to this.

Yeah, so I hoped a better fix.

Anyway a fix for the issue was committed:
  http://cvsweb.netbsd.org/cgi-bin/cvsweb.cgi/src/sys/kern/subr_pool.c#rev1.220

Thanks,
  ozaki-r

Reply via email to