> Bit errors happen.  When they do, data is corrupted.

This is rather an oversimplification.

Single-bit errors *on the media* happen relatively frequently.  In fact, 
multi-bit errors are not too uncommon either.  Hence there is a lot of 
error-correction data written to the disc media.

The error-correction codes used in disc products tend to be proprietary these 
days, or at least not documented. However, the error rates are published. For a 
typical enterprise-class disc, the correctable error rate is around 1 in 10^12 
bits read, while the uncorrectable error rate is 1 in 10^15 bits read.

Note that neither of these events result in data "corruption", at least in the 
sense of "silent data corruption", though they result in data "loss".

It is possible for a pattern of bit errors to result in a miscorrection. These 
numbers are not published for disc drives, so far as I know. They are, however, 
for tape drives. A typical high-end StorageTek tape drive has an uncorrected 
bit error rate of 1 in 10^19 bits read, and a miscorrected bit error rate of 1 
in 10^33 bits read.

It seems plausible that a disc drive might have a miscorrected bit error rate, 
then, of no worse than 1 in 10^25 bits read. At 100 MB/second, this is roughly 
3000 million years.

Errors elsewhere in the system seem much more likely to introduce problems than 
on the media. Some low-end disks may not have any protection, or at best 
parity, on disk caches. The PCI bus has only parity. Both of these can be 
affected by double-bit errors.

If you’re running on a system without ECC memory, the most likely source of 
undetected errors is your main memory.
 
 
This message posted from opensolaris.org
_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Reply via email to