On 1/10/07, [EMAIL PROTECTED] <[EMAIL PROTECTED]> wrote:
"Dick Davies" <[EMAIL PROTECTED]> wrote on 01/10/2007 05:26:45 AM:
> On 08/01/07, [EMAIL PROTECTED] <[EMAIL PROTECTED]> wrote:
> > I think that in addition to lzjb compression, squishing blocks that
contain
> > the same data would buy a lot of space for administrators working in
many
> > common workflows.
>
> This idea has occurred to me too - I think there are definite
> advantages to 'block re-use'.
> When you start talking about multiple similar zones, I suspect
> substantial space savings could
> be made - and if you can re-use that saved storage to provide
> additional redundancy, everyone
> would be happy.

My favorite uses come to mind (I have spent a fair amount of time
wishing for this feature):

1) Zones that start out as ZFS clones will tend to diverge as the
system patches.   This will allow them to re-converge as the patches
roll through multiple zones.

2) Environments where each person starts with the same code base (hg
pull http://hg.intevation.org/mirrors/opensolaris.org/onnv-gate/) then
build it producing substantially similar object files.

3) Disk-based backup systems (de-duplication is a buzz word here)

That issue has already come up in the thread,  SHA256 is 2^128 for random,
2^80 for targeted collisions.  That is pretty darn good,  but it would also
make sense to perform a rsync like secondary check on match using a
dissimilar crypto hash.  If we hit very unlikely chance that 2 blocks match
both sha256 and whatever other secondary hash I think that block should be
lost (act of god). =)

Reading the full block and doing a full comparison is very cheap
(given the anticipated frequency) and makes you not have to explain
that the file system has a 2^512 chance of silent data corruption.  As
slim of a chance as that is, ZFS promises to not corrupt my data and
to tell on others that do.  ZFS cannot break that promise.

Mike

--
Mike Gerdts
http://mgerdts.blogspot.com/
_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Reply via email to