Brian McBride wrote: > I have some questions from a customer about zfs checksums. > Could anyone answer some of these? Thanks. > > Brian > > Customer: > I would like to know more about zfs's checksum feature. I'm guessing > it is something that is applied to the data and not the disks (as in > raid-5). >
RAID-5 does not do checksumming. It does a parity calculation, but many RAID-5 implementations do not actually check the parity unless a disk reports an error. ZFS always checks the checksum, unless you disable it. At this point, I usually explain how people find faults in their SAN because ZFS's checksum works end-to-end. > For performance reasons, I turned off checksum on our zfs filesystem > (along with atime updates). Because of a concern for possible data > corruption (silent data corruption), I'm interested in turning checksum > back on. When I do so, will it create checksums for existing files or > will they need to be rewritten? And can you tell me the overhead > involved with having checksum active (CPU time, additional space)? > > To put this in perspective, in general, the time it takes to read the data from disk is much larger than the time required to calculate the checksum. But, you can also use different checksum algorithms, with varying strength and computational requirements. By default, ZFS uses a Fletcher-2 algorithm, but you can enable Fletcher-4 or SHA-256. If you are planning to characterize the computational cost of checksums, please add these to your test plan and report back to us :-) -- richard _______________________________________________ zfs-discuss mailing list zfs-discuss@opensolaris.org http://mail.opensolaris.org/mailman/listinfo/zfs-discuss