> On Nov 23 Roland Dreir sent a patch for interrupt handling, but it
 > doesn't apply on -current since the file rt2661.c changed slightly
 > a few weeks earlier (1.51, date: 2009/11/01).

two "e"s in "Dreier" :)

 > This patch just changes Roland's patch to update against rt2661.c
 > r1.51 from the OpenBSD repository instead of Roland's patch which
 > is against his private GIT repo.

Sorry about that... I was testing on a 4.5 box, so even though I had
the patch against -current, I sent the wrong (backported) one.

 > I've been running with this for just over a day, including some
 > time copying kernels and snaps both ways non-stop (after removing
 > the ifconfig down/up from crontab). It has locked up only twice in
 > 24 hrs, a definite improvement.

Thanks for testing and keeping this patch alive.  I would like to see
this comitted since I have multiple reports of this improving
stability for people, and also I think that it is pretty clearly
correct on a theoretical level too.  However I have not seen any
response from damien@ unfortunately.

In my setup (slow VIA mini-itx box used as an AP) I've not seen any
lockups with the patch applied.  Could you give a quick description of
your setup?  Are the lockups you see the same as before -- ie the
interface stops with "OACTIVE" set, and recovers if you do if config
up/down?  (Is that the problem you were having before?)

I sent another patch 
(http://www.mail-archive.com/t...@openbsd.org/msg01261.html)
that helps my setup a little more (avoids the "interface stays up but
no longer sends broadcasts or multicasts" problem I saw -- not sure
why exactly but avoiding sending the adapter garbage descriptors seems
like a good idea in any case).  You could try with that too and see if
it helps at all.

I do still see another problem that I have not figured out yet, namely
the ral interface on the AP stops sending for 20 or 30 seconds and
then recovers by itself.  If I run ping to the AP on a client box and
leave something like "tcpdump -i ral0 -n icmp" running on the AP, then
I see that requests continue to be received during the interruption,
but no replies are sent.  Also I can see that OACTIVE is not set
during the interruption.  But I don't know why this is happening yet.
Is it possible that this is what you're hitting too?

Thanks,
  Roland

Reply via email to