On Tue, Apr 26, 2011 at 4:59 PM, Richard Elling
<richard.ell...@gmail.com> wrote:
>
> On Apr 26, 2011, at 8:22 AM, Cindy Swearingen wrote:
>
>> Hi--
>>
>> I don't know why the spare isn't kicking in automatically, it should.
>
> This can happen if the FMA agents aren't working properly.
>
> FYI, in NexentaStor we have added a zfs-monitor FMA agent to check the
> health of disks in use for ZFS and notice when they are no longer responding
> to reads.

    I just recently (this past week) had a very similar failure. zpool
consisting of two raidz2 vdevs and two hot spare drives. Each raidz2
vdev consists of 10 drives (I know, not the best layout, but the
activity is large sequential writes and reads and we needed the
capacity). We had a drive fail in one of the vdevs and one of the hot
spares automatically went into action (the special spare device within
the vdev came into being and the hot spare drive resilvered). A short
time later a second drive in the same vdev failed. No action by any
hot spare. The system was running Solaris 10U8 with no additional
patches.

    I opened a case with Oracle and they told me that the hot spare
*should* have dealt with the second failure. We replaced the first
(hot spared) drive with zpool replace and it resilvered fine. Then we
replaced the second (non hot spared) drive with zpool replace and the
system hung. I suspected the mpt (multipathing) driver for the SATA
drives in the J4400, there have been some huge improvements in that
driver since 10U8. After rebooting the drive appeared replaced and was
resilvering.

    Oracle support chocked the hot spare issue up to an FMA problem
but could not duplicate it in the lab. We have since upgraded to 10U9
+ the latest CPU (April 2011) and are hoping both the hot spare issue
and the mpt drive issue are fixed.

-- 
{--------1---------2---------3---------4---------5---------6---------7---------}
Paul Kraus
-> Senior Systems Architect, Garnet River ( http://www.garnetriver.com/ )
-> Sound Coordinator, Schenectady Light Opera Company (
http://www.sloctheater.org/ )
-> Technical Advisor, RPI Players
_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Reply via email to