Colin,

I think you mix up the filesystem layer (where the individual files as
maintained) and the block layer, where actual data is stored.

The analogue of deduplication on the filesystem layer would be to create hard
links of the files, where deleting one file does not remove the other link.

Block layer deduplication is a black box, see it simply as a compression,
which works in the background.

Thomas

* Colin Raven [Tue Dec 08, 2009 at 01:00:54PM +0100]:
  In reading this blog post:
  [1]http://blogs.sun.com/bobn/entry/taking_zfs_deduplication_for_a
  a question came to mind.....
  To understand the context of the question, consider the opening paragraph
  from the above post;

    Here is my test case: I have 2 directories of photos, totaling about
    90MB each. And here's the trick - they are almost complete duplicates of
    each other. I downloaded all of the photos from the same camera on 2
    different days. How many of you do that ? Yeah, me too.�

  OK, I consider myself in that category most certainly. Through just plain
  'ol sloppiness I must have multiple copies of some images. Sad self
  indictment...but anyway....
  What happens if, once dedup is on, I (or someone else with delete rights)
  open a photo management app containing that collection, and start deleting
  dupes - AND - happen to delete the original that all other references are
  pointing to. I know, I know, it doesn't matter - snapshots save the day -
  but in this instance that's not the point because I'm trying to properly
  understand the underlying dedup concept.
  Logically, if you delete what everything is pointing at, all the pointers
  are now null values, they are - in effect - pointing at nothing...an empty
  hole.
  I have the feeling the answer to this is; "no they don't, there is no
  spoon ("original") you're still OK". I suspect that, only because the
  people who thought this up couldn't possibly have missed such an "obvious"
  point. The problem I have is in trying to mentally frame this in such a
  way that I can subsequently explain it, if asked to do so (which I see
  coming for sure).
  Help in understanding this would be hugely helpful - anyone?
  Regards & TIA,
  -Me
_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Reply via email to