[EMAIL PROTECTED] wrote:
> Hello,
> 
> Thanks for your reply.
> 
> Selon Jon Radel <[EMAIL PROTECTED]>:
> 
>> Rémi Boulle wrote:
> 
>>> For what I understood dirvish write only the changed parts of the
>>> changed files.
>> While, if backing up across the network, rsync transfers only the
>> changed portions of changed files, what is written to disk is the entire
>> changed file.
> 
> Ok, so dirvish "just" add the part sent by rsync with the unchanged part 
> already
> on the hard drive ?
> 

Actually, rsync handles all that for dirvish.  Dirvish, to be very vague
and high level, basically says to rsync "make this directory on the
local machine look just like that one over on the remote machine--and
here's the directory that contains the last copy of you made of the
remote directory so try to minimize the work and disk space you use."


>>> My first image is roughly 5 Go, so why the second one is 5Go too ? (no
>>> files were changed)
>> How did you measure this?
> 
> Right click/properties ;-) (on ubuntu)

Don't take this too personally, but sometime you're going to either
learn to take everything your GUI is doing on faith or learn what it's
really doing behind the scenes.  :-)  I suspect it's actually calling du.

> 
>> If you simply ran du against each image, then
>> yes, they would both show as 5 gig.  However, if you run du against both
>> images at the same time, it should still show about 5 gig and not 10
>> gig.
> 
> So individually they use 5 gig each and globally they use 5gig + epsilon ?
> 

If the files are all exactly alike (and we ignore any soft links you
might have) then epsilon consists essentially of a second set of
directory entries.


>> Each image is equal, in the sense that no image is the "full
>> backup" and other images are the "differential backup."  They're all
>> "full," some were just made earlier, but there is only one copy of each
>> unchanged, unmoved file physically taking up blocks on the hard drive.
> 
> Ok, so eg the second image as a 5 gig space *reserved* on the filesystem (but
> not used).

Nope.  The second image is a new set of directory entries pointing to
the same data blocks on disk that the first set of directory entries
point at.  None of this is likely to make much sense until you
understand how hard links work in Unix file systems, and you're better
off with an explanation in some competent book (or check
wikipedia--there's probably something decent there) than me trying to
write something quick and semi-coherent.

> 
>> This all assuming that you've not made any configuration errors, or run
>> dirvish with the --init option more than once, or changed critical
>> configuration choices between backups.  It *is* possible to create
>> backups that have no link with each other.  (Which many of us do
>> deliberately for extra safety.
> 
>>From a disk space point of view it is totally equal no ? The only difference 
>>is
> from network load point of view...
> 
> Am I right ? :-)
> 
> Hum, so, is it possible to make differential backups everyday day and, once a
> month, a total backup (I mean with no link) ?
> 

Hmmmm.... I *think* you're confused, but I'm not certain.  Learn about
hard links and it should all make more sense.

>>> Does it mean that all those images are "real" images ? eg I can delete
>>> last monday image without having any problems ?
>>>
>> Unless you want to restore a file that was captured only in that backup.
>>  :-)  If you delete the last successful backup, you'll have to run
>> dirvish with the --init option again.  Other than that, you can delete
>> any image without an issue.
> 
> So the "mustn't to be touched" backups are the first one and the last one ?

Not exactly.  Some of this depends on policy, but some general guidelines:

Last one:  Probably don't want to delete this, as it is, of course, your
more recent backup.

Recent ones:  Probably don't want to delete any that are recent enough
that there's a chance that you've not yet discovered the silly mistake
you made while editing your favorite file.  In other words, if you
destroy the contents of your file on Monday and don't discover this
until Friday, you'll really glad that you have Sunday's backup too and
not only Thursday's.

Last remaining successful backup:  As stated above, it is more
convenient to have at least one fully successful backup in place,
otherwise your automated scripts will probably fail and you'll have to
manually run dirvish with --init.  However, this does *not* have to be
the first backup.  It can be any backup.  So far as dirvish is
concerned, all the backups are of equivalent status and validity.  (This
is the reason that dirvish's own expire script will refuse to delete the
last successful backup.)

You'll see plenty of example configuration files where you'll note that
all backups are deleted by the time they're a year or two old.


--Jon Radel

Attachment: smime.p7s
Description: S/MIME Cryptographic Signature

_______________________________________________
Dirvish mailing list
Dirvish@dirvish.org
http://www.dirvish.org/mailman/listinfo/dirvish

Reply via email to