On Wed, Sep 12, 2007 at 11:20:52PM +0100, Peter Tribble wrote: > On 9/10/07, Pawel Jakub Dawidek <[EMAIL PROTECTED]> wrote: > > Hi. > > > > I've a prototype RAID5 implementation for ZFS. It only works in > > non-degraded state for now. The idea is to compare RAIDZ vs. RAID5 > > performance, as I suspected that RAIDZ, because of full-stripe > > operations, doesn't work well for random reads issued by many processes > > in parallel. > > > > There is of course write-hole problem, which can be mitigated by running > > scrub after a power failure or system crash. > > If I read your suggestion correctly, your implementation is much > more like traditional raid-5, with a read-modify-write cycle? > > My understanding of the raid-z performance issue is that it requires > full-stripe reads in order to validate the checksum. [...]
No, checksum is independent thing, and this is not the reason why RAIDZ needs to do full-stripe reads - in non-degraded mode RAIDZ doesn't read parity. This is how RAIDZ fills the disks (follow the numbers): Disk0 Disk1 Disk2 Disk3 D0 D1 D2 P3 D4 D5 D6 P7 D8 D9 D10 P11 D12 D13 D14 P15 D16 D17 D18 P19 D20 D21 D22 P23 D is data, P is parity. And RAID5 does this: Disk0 Disk1 Disk2 Disk3 D0 D3 D6 P0,3,6 D1 D4 D7 P1,4,7 D2 D5 D8 P2,5,8 D9 D12 D15 P9,12,15 D10 D13 D16 P10,13,16 D11 D14 D17 P11,14,17 As you can see even small block is stored on all disks in RAIDZ, where on RAID5 small block can be stored on one disk only. -- Pawel Jakub Dawidek http://www.wheel.pl [EMAIL PROTECTED] http://www.FreeBSD.org FreeBSD committer Am I Evil? Yes, I Am!
pgp5p7Tq85M8q.pgp
Description: PGP signature
_______________________________________________ zfs-discuss mailing list zfs-discuss@opensolaris.org http://mail.opensolaris.org/mailman/listinfo/zfs-discuss