On 29-Jan-07, at 9:04 PM, Al Hopper wrote:
On Mon, 29 Jan 2007, Toby Thain wrote:
Hi,
This is not exactly ZFS specific, but this still seems like a
fruitful place to ask.
It occurred to me today that hot spares could sit in standby (spun
down) until needed (I know ATA can do this, I'm supposing SCSI does
too, but I haven't looked at a spec recently). Does anybody do this?
Or does everybody do this already?
I don't work with enough disk storage systems to know what is the
industry
norm. But there are 3 broad categories of disk drive spares:
a) Cold Spare. A spare where the power is not connected until it is
required. [1]
b) Warm Spare. A spare that is active but placed into a low power
mode. ...
c) Hot Spare. A spare that is spun up and ready to accept
read/write/position (etc) requests.
Hi Al,
Thanks for reminding me of the distinction. It seems very few
installations would actually require (c)?
Does the tub curve (chance of early life failure) imply that hot
spares should be burned in, instead of sitting there doing nothing
from new? Just like a data disk, seems to me you'd want to know if a
hot spare fails while waiting to be swapped in. Do they get tested
periodically?
The ideal scenario, as you already allude to, would be for the disk
subsystem to initially configure the drive as a hot spare and send it
periodic "test" events for, say, the first 48 hours.
For some reason that's a little shorter than I had in mind, but I
take your word that that's enough burn-in for semiconductors, motors,
servos, etc.
This would get it
past the first segment of the "bathtub" reliability curve ...
If saving power was the highest priority, then the ideal situation
would
be where the disk subsystem could apply/remove power to the spare
and move
it from warm to cold upon command.
I am surmising that it would also considerably increase the spare's
useful lifespan versus "hot" and spinning.
One "trick" with disk subsystems, like ZFS that have yet to have
the FMA
type functionality added and which (today) provide for hot spares
only, is
to initially configure a pool with one (hot) spare, and then add a
2nd hot
spare, based on installing a brand new device, say, 12 months
later. And
another spare 12 months later. What you are trying to achieve,
with this
strategy, is to avoid the scenario whereby mechanical systems, like
disk
drives, tend to "wear out" within the same general, relatively short,
timeframe.
One (obvious) issue with this strategy, is that it may be
impossible to
purchase the same disk drive 12 and 24 months later. However, it's
always
possible to purchase a larger disk drive
...which is not guaranteed to be compatible with your storage
subsystem...!
--Toby
and simply commit to the fact
that the extra space provided by the newer drive will be wasted.
[1] The most common example is a disk drive mounted on a carrier
but not
seated within the disk drive enclosure. Simple "push in" when
required.
...
Al Hopper Logical Approach Inc, Plano, TX. [EMAIL PROTECTED]
Voice: 972.379.2133 Fax: 972.379.2134 Timezone: US CDT
OpenSolaris.Org Community Advisory Board (CAB) Member - Apr 2005
OpenSolaris Governing Board (OGB) Member - Feb 2006
_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss