On 8/6/2020 11:09 AM, deloptes wrote:
Leslie Rhorer wrote:
And how useful is that? There are very few duplicate files on my
systems, because I use applications to eliminate duplicates.
Eliminating duplicates in a live data repository is far more important
than doing it on backup media.
May be you misunderstand deduplication or duplication. It is not about files
you have on your disk, but what is already in the backup repository, so you
avoid duplicating the backup of the same file for example in each full
backup. Because most of the files do not change, this is how you save that
much space.
That is the default for DAR for a given existing isolate*. Any backup
software that is aware of its own history can accomplish this, even TAR.
The command `tar -u` will append only new files or files that have
changed to the end of the archive. All you seem to be talking about is
a differential backup.
* - An isolate is essentially a listing of all the files associated with
an archive along with the history if its actions. DAR has the ability
to isolate a backup and work from it. The isolate is tiny.