Re: [zfs-discuss] Re: Re: Recommendation ZFS on StorEdge 3320

Torrey McMahon Tue, 12 Sep 2006 13:55:54 -0700

UNIX admin wrote:

This is simply not true. ZFS would protect against
the same type oferrors seen on an individual drive as it would on apool made of HW raidLUN(s). It might be overkill to layer ZFS on top of aLUN that isalready protected in some way by the devices internalRAID code but itdoes not "make your data susceptible to HW errorscaused by the storagesubsystem's RAID algorithm, and slow down the I/O".
I disagree, and vehemently at that. I maintain that if the HW RAID is used, the 
chance of data corruption is much higher, and ZFS would have a lot more 
repairing to do than it would if it were used directly on disks. Problems with 
HW RAID algorithms have been plaguing us for at least 15 years or more. The 
venerable Sun StorEdge T3 comes to mind!

Please expand on your logic. Remember that ZFS works on top of LUNs. Adisk drive by itself is a LUN when added to a ZFS pool. A LUN can alsobe comprised of multiple disk drives striped together and presented to ahost as one logical unit. Or a LUN can be offered by a virtualizationgateway that in turn imports raid array LUNs that are really made up ofindividual disk drives. Or ... insert a million different ways to get ahost something called a LUN that allows the host to read and writeblocks. They could be really slow LUNs because they're two hamstersshuffling zeros and ones back and forth on little wheels. (OK, thatmight be too slow.) Outside of the cache enabling when entire diskdrives are presented to the pool ZFS doesn't care what the LUN is made of.

ZFS reliability features are available and work on top of the LUNs yougive it and the configuration you use. The type of LUN isinconsequential at the ZFS level. If I had 12 LUNS that were single diskdrives and created a RAIDZ pool it would have the same reliability atthe ZFS level as if I presented it 12 LUNs that were really quad-mirrorsfrom 12 independent hw raid array. You can make argument that the 12disk drive config is easier to use or that the overall reliability ofthe 12 quad-mirror LUNs system has a higher reliability but at ZFSspoint of view it's the same. Its happily writing blocks, checkingchecksums, reading things from the LUNs, etc. etc. etc.

On top of that disk drives are not some simple beast that just coughs upi/o when you want it to. A modern disk drive does all sorts of stuffunder the covers to speed up i/o and - surprise - increase thereliability of the drive as much as possible. If you think you're reallywriting "straight to disk" you're not. Cache, ZBR, bad blockre-allocation, all come into play.

As for problems with specific raid arrays, including the T3, you arepreaching to the choir but I'm definitely not going to get into apissing contest over specific components having more or less bugs thenan other.

Further, while it is perfectly logical to me that doing RAID calculations twice 
is slower than doing it once, you maintain that is not the case, perhaps 
because one calculation is implemented in FW/HW?

As the man says, "It depends". A really fast raid array might beresponding to i/o requests faster then a single disk drive. It might notgiven the nature of the i/o coming in.

Don't think of it in terms of RAID calculations taking a certain amountof time. Think of it in terms of having to meet a specific amount ofrequirements to manage your data. I'll be the first to say that ifyou're going to be putting ZFS on a desktop then a simple JBOD is a boxto look at. If you're going to look at an enterprise data center theanswer is going to be different. That is something a lot of people onthis alias seem to be missing out on. Stating ZFS on JBODs is the answerto everything is the punchline of the "When all you have is a hammer..."routine.



_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] Re: Re: Recommendation ZFS on StorEdge 3320

Reply via email to