Re: [zfs-discuss] Data balance across vdevs

Erik Trimble Fri, 20 Nov 2009 23:24:35 -0800

Something occurs to me: how full is your current 4 vdev pool? I'massuming it's not over 70% or so.

yes, by adding another 3 vdevs, any writes will be biased towards the"empty" vdevs, but that's for less-than-full-stripe-width writes (right,Richard?). That is, if I'm doing a write that would be full-stripesize, and I've got enough space on all vdevs (even if certain ones aremuch fuller than others), then it will write across all vdevs.

So, while you can't get a virgin pool out of this, I think you can getstuff reasonably well-balanced by recopying then deleting say 1TB (orless) of data at a time.



Richard Elling wrote:

On Nov 20, 2009, at 12:14 PM, Jesse Stroik wrote:
There are, of course, job types where you use the same set of datafor multiple jobs, but having even a small amount of extra memoryseems to be very helpful in that case, as you'll have several nodesreading the same data at roughly the same time.
Yep. More, faster memory closer to the consumer is always better.
You could buy machines with TBs of RAM, but high-end x86 boxes top
out at 512 GB.
That was our previous approach. We're testing doing it withrelatively cheap, consumer-level Sun hardware (ie: machines with 64or maybe 128GB of memory today) that can be easily expanded as thepool's purpose changes.
I know what our options are for increasing performance if we want toincrease the budget. My question isn't, "I have this data set, canyou please tell me how to buy and configure a system." My questionis, "how does ZFS balance pools during writes, and how can I force itto balance data I want balanced in the way I want it balanced?" Andif the answer to that question is, "you can't reliably do this," thenthat is acceptable. It's something I would like to be able to planaround.

From a user's standpoint, you can't "force" ZFS to do the block layoutin a manner you specify. The best you can do is understand what ZFSdoes in a given situation. There's no ability to TELL ZFS what to do.

Writes (allocations) are biased towards freer (in the percentagesense) offully functional vdevs. However, diversity for copies and affinityfor gang blocksis preserved. The starting point for understanding this in the codeis at:http://src.opensolaris.org/source/xref/onnv/onnv-gate/usr/src/uts/common/fs/zfs/metaslab.c
Right now, this storage node is very small (~100TB) and in testing.I want to know how I can solve problems like this as we scale it upinto a full fledged SAN that holds a lot more data and gets movedinto production. Knowing the limitations of ZFS is a critical partof properly designing and expanding the system.

For a lot of reasons, I would consider creating NEW zpools when you addnew disk space in large lots, rather than adding vdevs to existingzpools. It should prove no harder to manage, and allows you to get avirgin zpool which will provide the best performance.

Sometimes, ignorance is bliss :-)
 -- richard

oooh, then I must be ecstatically happy!

--
Erik Trimble
Java System Support
Mailstop:  usca22-123
Phone:  x17195
Santa Clara, CA

_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] Data balance across vdevs

Reply via email to