[appologies for being away from my data last week]

David Dyer-Bennet wrote:
The more I look at it the more I think that a second copy on the same
disk doesn't protect against very much real-world risk.  Am I wrong
here?  Are partial(small) disk corruptions more common than I think?
I don't have a good statistical view of disk failures.

This question was asked many times in this thread.  IMHO, it is the
single biggest reason we should implement ditto blocks for data.

We did a study of disk failures in an enterprise RAID array a few
years ago.  One failure mode stands heads and shoulders above the
others: non-recoverable reads.  A short summary:

  2,919 total errors reported
  1,926 (66.0%) operations succeeded (eg. write failed, auto reallocated)
    961 (32.9%) unrecovered errors (of all types)
     32 (1.1%) other (eg. device not ready)
    707 (24.2%) non-recoverable reads

In other words, non-recoverable reads represent 73.6% of the non-
recoverable failures that occur, including complete drive failures.
Boo!  Did that scare you?  Halloween is next month! :-)
Seagate said today that in a few years 3.5" disks will store 2.5 TBytes.
Boo!

While I don't have data on laptop disk failures, I would not be surprised
to see a similar distribution, though with a larger mechanical damage
count.  My laptops run hotter inside than my other systems and, as a rule
of thumb, your disk failure rate increases by 2x for every 15C change in
temperature.  Is your laptop disk hot?

The case for ditto data is clear to me.  Many people are using single-disk
systems, and many more people would really like to use single-disk systems
but they really can't.

Beyond spinning rust systems, there are other forms of non-volatile
storage which would apply here.  For example, those people who suggested
that you should backup your presentation to a CD fail to note that a spec
of dust on the CD could lead you to lose one block of data.  In my CD/DVD
experience, such losses are blissfully ignored by the system and you may
blame the resulting crash on the cheap hardware you bought from your
brother-in-law.  Beyond CDs, I can see this as being a nice enhancement
to limited endurance devices such as flash.

While it is true that I could slice my disk up into multiple vdevs and
mirror them, I'd much rather set a policy at a finer grainularity: my
files are more important than most of the other, mostly read-only and
easily reconstructed, files on my system.

When ditto blocks for metadata was introduced, I took a look at the
code and was pleasantly suprised.  The code does an admirable job of
ensuring spatial diversity in the face of multiple policies, even in
the single disk case.  IMHO, this is the right way to implement this
and allows you to mix policies with ease.

As a RAS guy, I'm biased to not wanting to lose data via easy-to-use
interfaces.  I don't see how this feature has any downside, but lots
of upside.
 -- richard
_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Reply via email to