On 06/12/2012 03:57 PM, Sašo Kiselkov wrote:
> Seems the problem is somewhat more egregious than I thought. The xcall
> storm causes my network drivers to stop receiving IP multicast packets
> and subsequently my recording applications record bad data, so
> ultimately, this kind of isn't workable... I need to somehow resolve
> this... I'm running four on-board Broadcom NICs in an LACP
> aggregation. Any ideas on why this might be a side-effect? I'm really
> kind of out of ideas here...
> 
> Cheers,
> --
> Saso

Just as another datapoint, though I'm not sure if it's going to be much
use, is that I found (via arcstat.pl) that the storms always start
happen when ARC downsizing starts. E.g. I would see the following in
"./arcstat.pl 1":

    Time  read    dmis  dm%  pmis  pm%  mmis  mm%  arcsz     c
16:29:45    21       0    0     0    0     0    0   111G  111G
16:29:46     0       0    0     0    0     0    0   111G  111G
16:29:47     1       0    0     0    0     0    0   111G  111G
16:29:48     0       0    0     0    0     0    0   111G  111G
16:29:49    5K       0    0     0    0     0    0   111G  111G
  (this is where the problem starts)
16:29:50    36       0    0     0    0     0    0   109G  107G
16:29:51    51       0    0     0    0     0    0   107G  107G
16:29:52    10       0    0     0    0     0    0   107G  107G
16:29:53   148       0    0     0    0     0    0   107G  107G
16:29:54    5K       0    0     0    0     0    0   107G  107G
  (and after a while, around 10-15 seconds, it stops)

(I omitted the miss and miss% columns to make the rows fit).

During the time, the network stack is dropping input IP multicast UDP
packets like crazy, so I see my network input drop by about 30-40%.
Truly strange behavior...

Cheers,
--
Saso
_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Reply via email to