|
Darren and Henk; Firstly, thank you very much for both of your replies. I am very grateful indeed for you all taking time off to answer my questions. I understand RAID-5 quite well and from both of your RAID-Z description, I see that the RAID-Z parity is also a separate block on a separate disk. Very well. This is just like RAID-5. My confusion is simple. Would this not then give rise also to the write-hole vulnerability of RAID-5? Jeff Bonwick states "that there's no way to update two or more disks atomically, so RAID stripes can become damaged during a crash or power outage." If I understand correctly, then the parity block for RAID-Z are also written in two different atomic operations. As per RAID-5. (the only difference being each can be of a different stripe size). How then does it fit into Jeff's statement that "Every block is its own RAID-Z stripe?" ( Perhaps I misunderstood but I now think this statement is rather for the fact that RAID-Z has a variable stripe size rather than meaning each block holding it's own RAID-Z parity within itself. ) If the write-hole power outage situation as described by Jeff Bonwick occur, how does RAID-Z "beat" the RAID-5 write-hole vulnerability? Through each block's independant checksum held one level above in the metadata block? Is this correct? Or am I completely off course? Henk Langeveld wonderful character based diagrams describes what is basically a standard RAID-5 layout on 4 disks. How is RAID-Z any different from RAID-5? (except for the ability to stripe different sizes which gives allows RAID-Z to never have to do a read-modify-write. This increases performance very significantly but I am unable to relate this to the write-hole vulnerability issue). Warmest Regards Steven Sim Darren Dunham wrote:
Fujitsu Asia Pte. Ltd. _____________________________________________________ This e-mail is confidential and may also be privileged. If you are not the intended recipient, please notify us immediately. You should not copy or use it for any purpose, nor disclose its contents to any other person. Opinions, conclusions and other information in this message that do not relate to the official business of my firm shall be understood as neither given nor endorsed by it. |
_______________________________________________ zfs-discuss mailing list [email protected] http://mail.opensolaris.org/mailman/listinfo/zfs-discuss
