On 01/28/11 02:38 PM, Igor P wrote:
I created a zfs pool with dedup with the following settings:
zpool create data c8t1d0
zfs create data/shared
zfs set dedup=on data/shared

The thing I was wondering about was it seems like ZFS only dedup at the file 
level and not the block. When I make multiple copies of a file to the store I 
see an increase in the deup ratio, but when I copy similar files the ratio 
stays at 1.00x.
Igor, ZFS does indeed perform dedup at the block level. Identical files have identical blocks, of course, but "similar" files may have differences such that data is inserted, deleted or changed so each block is different. Same data has to be on the same block alignment to have duplicate blocks. Also, it's important to have lots of RAM or high speed devices to quickly access metadata, or removing data will take a lot of time, so please use appropriately sized systems. That's been discussed a lot on this list.

See Jeff Bonwick's blog for a very good description: http://blogs.sun.com/bonwick/entry/zfs_dedup

I hope that's helpful,
  Jeff (a different Jeff)

_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Reply via email to