Hi Toby,

You're right. The healthcheck would definitely find any issues. I
misinterpreted your comment to that effect as a question and didn't
quite latch on. A zpool MAID-mode with that healthcheck might also be
interesting on something like a Thumper for pure-archival, D2D backup
work. Would dramatically cut down on the power. What do y'all think?

Best Regards,
Jason

On 1/29/07, Toby Thain <[EMAIL PROTECTED]> wrote:

On 29-Jan-07, at 11:02 PM, Jason J. W. Williams wrote:

> Hi Guys,
>
> I seem to remember the Massive Array of Independent Disk guys ran into
> a problem I think they called static friction, where idle drives would
> fail on spin up after being idle for a long time:

You'd think that probably wouldn't happen to a spare drive that was
spun up from time to time. In fact this problem would be (mitigated
and/or) caught by the periodic health check I suggested.

--T

> http://www.eweek.com/article2/0,1895,1941205,00.asp
>
> Would that apply here?
>
> Best Regards,
> Jason
>
> On 1/29/07, Toby Thain <[EMAIL PROTECTED]> wrote:
>>
>> On 29-Jan-07, at 9:04 PM, Al Hopper wrote:
>>
>> > On Mon, 29 Jan 2007, Toby Thain wrote:
>> >
>> >> Hi,
>> >>
>> >> This is not exactly ZFS specific, but this still seems like a
>> >> fruitful place to ask.
>> >>
>> >> It occurred to me today that hot spares could sit in standby (spun
>> >> down) until needed (I know ATA can do this, I'm supposing SCSI
>> does
>> >> too, but I haven't looked at a spec recently). Does anybody do
>> this?
>> >> Or does everybody do this already?
>> >
>> > I don't work with enough disk storage systems to know what is the
>> > industry
>> > norm.  But there are 3 broad categories of disk drive spares:
>> >
>> > a) Cold Spare.  A spare where the power is not connected until
>> it is
>> > required.  [1]
>> >
>> > b) Warm Spare.  A spare that is active but placed into a low power
>> > mode. ...
>> >
>> > c) Hot Spare.  A spare that is spun up and ready to accept
>> > read/write/position (etc) requests.
>>
>> Hi Al,
>>
>> Thanks for reminding me of the distinction. It seems very few
>> installations would actually require (c)?
>>
>> >
>> >> Does the tub curve (chance of early life failure) imply that hot
>> >> spares should be burned in, instead of sitting there doing nothing
>> >> from new? Just like a data disk, seems to me you'd want to know
>> if a
>> >> hot spare fails while waiting to be swapped in. Do they get tested
>> >> periodically?
>> >
>> > The ideal scenario, as you already allude to, would be for the disk
>> > subsystem to initially configure the drive as a hot spare and
>> send it
>> > periodic "test" events for, say, the first 48 hours.
>>
>> For some reason that's a little shorter than I had in mind, but I
>> take your word that that's enough burn-in for semiconductors, motors,
>> servos, etc.
>>
>> > This would get it
>> > past the first segment of the "bathtub" reliability curve ...
>> >
>> > If saving power was the highest priority, then the ideal situation
>> > would
>> > be where the disk subsystem could apply/remove power to the spare
>> > and move
>> > it from warm to cold upon command.
>>
>> I am surmising that it would also considerably increase the spare's
>> useful lifespan versus "hot" and spinning.
>>
>> >
>> > One "trick" with disk subsystems, like ZFS that have yet to have
>> > the FMA
>> > type functionality added and which (today) provide for hot spares
>> > only, is
>> > to initially configure a pool with one (hot) spare, and then add a
>> > 2nd hot
>> > spare, based on installing a brand new device, say, 12 months
>> > later.  And
>> > another spare 12 months later.  What you are trying to achieve,
>> > with this
>> > strategy, is to avoid the scenario whereby mechanical systems, like
>> > disk
>> > drives, tend to "wear out" within the same general, relatively
>> short,
>> > timeframe.
>> >
>> > One (obvious) issue with this strategy, is that it may be
>> > impossible to
>> > purchase the same disk drive 12 and 24 months later.  However, it's
>> > always
>> > possible to purchase a larger disk drive
>>
>> ...which is not guaranteed to be compatible with your storage
>> subsystem...!
>>
>> --Toby
>>
>> > and simply commit to the fact
>> > that the extra space provided by the newer drive will be wasted.
>> >
>> > [1] The most common example is a disk drive mounted on a carrier
>> > but not
>> > seated within the disk drive enclosure.  Simple "push in" when
>> > required.
>> > ...
>> > Al Hopper  Logical Approach Inc, Plano, TX.  [EMAIL PROTECTED]
>> approach.com
>> >            Voice: 972.379.2133 Fax: 972.379.2134  Timezone: US CDT
>> > OpenSolaris.Org Community Advisory Board (CAB) Member - Apr 2005
>> >              OpenSolaris Governing Board (OGB) Member - Feb 2006
>>
>> _______________________________________________
>> zfs-discuss mailing list
>> zfs-discuss@opensolaris.org
>> http://mail.opensolaris.org/mailman/listinfo/zfs-discuss
>>


_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Reply via email to