matt.connolly...@gmail.com said:
>                     extended device statistics              
>     r/s    w/s   kr/s   kw/s wait actv wsvc_t asvc_t  %w  %b device
>     1.2   36.0  153.6 4608.0  1.2  0.3   31.9    9.3  16  18 c12d0
>     0.0  113.4    0.0 7446.7  0.8  0.1    7.0    0.5  15   5 c8t0d0
>     0.2  106.4    4.1 7427.8  4.0  0.1   37.8    1.4  93  14 c8t1d0
>                     extended device statistics              
>     r/s    w/s   kr/s   kw/s wait actv wsvc_t asvc_t  %w  %b device
>     0.4   73.2   25.7 9243.0  2.3  0.7   31.6    9.8  34  37 c12d0
>     0.0  226.6    0.0 24860.5  1.6  0.2    7.0    0.9  25  19 c8t0d0
>     0.2  127.6    3.4 12377.6  3.8  0.3   29.7    2.2  91  27 c8t1d0
>                     extended device statistics              
>     r/s    w/s   kr/s   kw/s wait actv wsvc_t asvc_t  %w  %b device
>     0.0   44.2    0.0 5657.6  1.4  0.4   31.7    9.0  19  20 c12d0
>     0.2   76.0    4.8 9420.8  1.1  0.1   14.2    1.7  12  13 c8t0d0
>     0.0   16.6    0.0 2058.4  9.0  1.0  542.1   60.2 100 100 c8t1d0
>                     extended device statistics              
>     r/s    w/s   kr/s   kw/s wait actv wsvc_t asvc_t  %w  %b device
>     0.0    0.2    0.0   25.6  0.0  0.0    0.3    2.3   0   0 c12d0
>     0.0    0.0    0.0    0.0  0.0  0.0    0.0    0.0   0   0 c8t0d0
>     0.0   11.0    0.0 1365.6  9.0  1.0  818.1   90.9 100 100 c8t1d0 
> . . .

matt.connolly...@gmail.com said:
> I expect that the c8t0d0 WD Green is the lemon here and for some reason is
> getting stuck in periods where it can write no faster than about 2MB/s. Does
> this sound right? 

No, it's the opposite.  The drive sitting at 100%-busy, c8t1d0, while the
other drive is idle, is the sick one.  It's slower than the other, has 9.0
operations waiting (queued) to finish.  The other one is idle because it
has already finished the write activity and is waiting for the slow one
in the mirror to catch up.  If you run "iostat -xn" without the interval
argument, i.e. so it prints out only one set of stats, you'll see the
average performance of the drives since last reboot.  If the "asvc_t"
figure is significantly larger for one drive than the other, that's a
way to identify the one which has been slower over the long term.


> Secondly, what I wonder is why it is that the whole file system seems to hang
> up at this time. Surely if the other drive is doing nothing, a web page can
> be served by reading from the available drive (c8t1d0) while the slow drive
> (c8t0d0) is stuck writing slow. 

The available drive is c8t0d0 in this case.  However, if ZFS is in the
middle of a txg (ZFS transaction) commit, it cannot safely do much with
the pool until that commit finishes.  You can see that ZFS only lets 10
operations accumulate per drive (used to be 35), i.e. 9.0 in the "wait"
column, and 1.0 in the "actv" column, so it's kinda stuck until the
drive gets its work done.

Maybe the drive is failing, or maybe it's one of those with large sectors
that are not properly aligned with the on-disk partitions.

Regards,

Marion


_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Reply via email to