Re: [zfs-discuss] Thin device support in ZFS?

Richard Elling Thu, 31 Dec 2009 10:04:49 -0800

On Dec 31, 2009, at 1:43 AM, Andras Spitzer wrote:

Let me sum up my thoughts in this topic.
To Richard [relling] : I agree with you this topic is even moreconfusing if we are not careful enough to specify exactly what weare talking about. Thin provision can be done on multiple layers,and though you said you like it to be closer to the app than closerto the dumb disks (if you were referring to SAN), my opinion is thateach and every scenario has it's own pros/cons. I learned long timeago not to declare a technology good/bad, there are technologieswhich are used properly (usually declared as good tech) and otherswhich are not (usually declared as bad).

I hear you. But you are trapped thinking about 20th century designsand ZFS is a

21st century design.  More below...

Let me clarify my case, and why I mentioned thin devices on SANspecifically. Many people replied with the thin device support ofZFS (which is called sparse volumes if I'm correct), but what I wastalking about is something else. It's thin device "awareness" on theSAN.
In this case you configure your LUN in the SAN as thin device, avirtual LUN(s) which is backed by a pool of physical disks in theSAN. From the OS it's transparent, so it is from the Volume Manager/Filesystem point of view.
That is the basic definition of my scenarion with thin devices onSAN. High-end SAN frames like HDS USP-V (feature called "HitachiDynamic Provisioning"), EMC Symmetrix V-Max (feature called "Virtualprovisioning") supports this (and I'm sure many others as well). Asyou discovered the LUN in the OS, you start to use it, like putunder Volume Manager, create filesystem, copy files, but the SANonly allocates physical blocks (more precisely group of blockscalled extents) as you write them, which means you'll use only asmuch (or a bit more rounded to the next extent) on the physical diskas you use in reality.
From this standpoint we can define two terms, thin-friendly andthin-hostile environments. Thin-friendly would be any environmentwhere OS/VM/FS doesn't write to blocks it doesn't really use (forexample during initialization it doesn't fills up the LUN with apattern or 0s).
That's why Veritas' SmartMove is a nice feature, as when you movefrom fat to thin devices (from the OS both LUNs look exactly thesame), it will copy the blocks only which are used by the VxFS files.


ZFS does this by design. There is no way in ZFS to not do this.
I suppose it could be touted as a "feature" :-)  Maybe we should brand
ZFS as "THINbyDESIGN(TM)"  Or perhaps we can rebrand
SMARTMOVE(TM) as TRYINGTOCATCHUPWITHZFS(TM) :-)

That is still the basics of having thin devices on SAN, and hope tohave a thin-friendly environment. The next level of this is themanagement of the thin devices and the physical pool where thindevices allocates their extents from.
Even if you get migrated to thin device LUNs, your thin devices willbecome fat again, even if you fill up your filesystem once, the thindevice on the SAN will remain fat, no space reclamation is happeningby default. The reason is pretty simple, the SAN storage has noknowledge of the filesystem structure, as such it can't decidewhether a block should be released back to the pool, or it's reallynot in use. Then came Veritas with this brilliant idea of building abridge between the FS and the SAN frame (this became the ThinReclamation API), so they can communicate which blocks are not inuse indeed.
I really would like you to read this Quick Note from Veritas aboutthis feature, it will explain way better the concept as I did : http://ftp.support.veritas.com/pub/support/products/Foundation_Suite/338546.pdf
Btw, in this concept VxVM can even detect (via ASL) whether a LUN isthin device/thin device reclamation capable or not.


Correct.  Since VxVM and VxFS are separate software, they have expanded
the interface between them.

Consider adding a mirror or replacing a drive.

Prior to SMARTMOVE, VxVM had no idea what part of the volume was data

and what was unused. So VxVM would silver the mirror by copying all ofthe

blocks from one side to the other. Clearly this is uncool when your SAN
storage is virtualized.

With SMARTMOVE, VxFS has a method to tell VxVM that portions of the
volume are unused. Now when you silver the mirror, VxVM knows that
some bits are unused and it won't bother to copy them.  This is a bona
fide good thing for virtualized SAN arrays.

ZFS was designed with the knowledge that the limited interface between
file systems and volume managers was a severe limitation that leads to
all sorts of complexity and angst. So a different design is needed.  ZFS
has fully integrated RAID with the file system, so there is no need, by
design, to create a new interface between these layers. In other words,

the only way to silver a disk in ZFS is to silver the data. You can'tsilver

unused space. There are other advantages as well.  For example, in
ZFS silvers are done in time order, which has benefits for recovery
when devices are breaking all around you.  Jeff describes this rather
nicely in his blog:
        http://blogs.sun.com/bonwick/entry/smokin_mirrors

In short. ZFS doesn't need SMARTMOVE because it doesn't have the
antiquated view of storage management that last century's designs
had. Also, ZFS users who don't use snapshots could benefit from TRIM.

Honestly I have mixed feeling about ZFS. I feel that this isobviously the future's VM/Filesystem, but then I realize in the sametime the roles of the individual parts in the big picture aregetting mixed up. Am I the only one with the impression that ZFSsooner or later will evolve to a SAN OS, and the zfs, zpool commandswill only become some lightweight interfaces to control the SANframe? :-) (like Solution Enabler for EMC)


I don't see that evolution. But I've always contended that storage
arrays are just specialized servers which speak a limited set of
protocols.  After all, there is no such thing as "hardware RAID,"
all RAID is done in software. So my crystal ball says that such
limited server OSes will have a hard life ahead of them.

If you ask me the pool concept always works more efficient if 1# youhave more capacity in the pool 2# if you have more systems to sharethe pool, that's why I see the thin device pool more rational in aSAN frame.
Anyway, I'm sorry if you were already aware what I explained above,I also hope I didn't offend anyone with my views,


I have a much simpler view of VxFS and VxVM.  They are neither
open source nor free, but they are so last century :-)
 -- richard

_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] Thin device support in ZFS?

Reply via email to