I'm replying inline because it is easier.

>>>>> On Mon, 9 Nov 2020 12:38:46 -0500, Brannon Smith said:
> 
> I suppose I misinterpreted this from the manual:
> 
> "In accurate mode, the File daemon knows exactly which files were present
> after the last backup. So it is able to handle deleted or renamed files.
> 
> When restoring a FileSet for a specified date (including "most recent"),
> Bacula is able to restore exactly the files and directories that existed at
> the time of the last backup prior to that date including ensuring that
> deleted files are actually deleted, and renamed directories are restored
> properly."
> 
> So what is the difference between an accurate and a non-accurate
> incremental backup, if I have a fileset where one of the files has been
> renamed? Is the idea that a renamed file won't have a modified mtime, so
> ordinarily it would not be seen as needing to be backed up?

Yes, that's almost correct.  The mtime of a renamed file is not changed, but
the ctime is changed so that would be included in a non-accurate backup.  The
problem is when you rename a directory, which doesn't change the mtime or
ctime of the files below it in the tree.


> I apologize for the basic questions, but let me explain what I think is
> happening with the accurate backup and maybe you can tell me where I have
> gone wrong.
> Bacula looks at the current file and the accurate options; if the md5
> (along with the other options) of the file matches the options and md5 of a
> file already stored for the fileset, then it isn't backed up (i.e., the
> file was renamed). Does Bacula not keep a record of the new file name?

That's not quite correct.  When doing an accurate diff/incr backup, the file
will be backed up using its current file name if either of the following is
true:

1) The current file name is not listed in the catalog (i.e. it is a new name,
either a completely new file or a renamed file).

2) One of the attibutes listed in the accurate options has changed from the
value listed in the catalog for the current file name.


> Thinking through it, I see why it would have to calculate the md5 for every
> file. I guess what I am really looking for is a way to check the ctime (to
> detect a rename) and then only backup the file if the md5 has changed. But
> that doesn't seem to be an option.

Yes, that's correct.

__Martin


> 
> Thanks again for all the help,
> 
> B. Smith
> 
> 
> On Mon, Nov 9, 2020 at 11:56 AM Martin Simmons <mar...@lispworks.com> wrote:
> 
> > Bacula doesn't have any code to match a renamed file with a previous
> > file.  It
> > just makes a backup of every unknown file or those that match the Accurate
> > options.
> >
> > Note that if you remove "5" from the Accurate options then you will need to
> > add "mc" to make it detect changes to existing files.
> >
> > __Martin
> >
> >
> > >>>>> On Sat, 7 Nov 2020 15:22:05 -0500, B Smith said:
> > >
> > > Hi,
> > >
> > > Thanks for the response. My (incorrect, I suppose) understanding was that
> > > Accurate=pins5 would just look at an unknown file and determine if it was
> > > an existing file that had been renamed. In that case, I'm fine with
> > taking
> > > the extra time to calculate the MD5 to verify the correspondence of the
> > > files. I wanted this option so that I wouldn't take up space on the tape
> > > with files that had just been renamed. Is there a way to do the same
> > thing
> > > without recalculating the MD5 for files that obviously haven't changed?
> > > Thank you,
> > >
> > > B. Smith
> > >
> > >
> > > On Sat, Nov 7, 2020 at 10:10 AM Eric Bollengier <e...@baculasystems.com>
> > > wrote:
> > >
> > > > Hello,
> > > >
> > > > On 2020-11-07 01:01, B. Smith wrote:
> > > > > Hi,
> > > > >
> > > > > When using accurate=yes and verify=pins5, does Bacula recalculate
> > the MD5
> > > > > for each file even when the atime/mtime/ctime of the file has not
> > > > changed?
> > > >
> > > > It is configurable, and from what I see in your fileset, this is what
> > > > you have set. "Accurate = pins5" (the character 5 at the end).
> > > >
> > > > To speed up things, you might want to not do it unless you data is very
> > > > critical.
> > > >
> > > > Hope it helps!
> > > >
> > > > Best Regards,
> > > > Eric
> > > >
> > > > > I have large files (~60GB) that rarely change in a directory; usually
> > > > just
> > > > > a new large file is added every couple of weeks... files are rarely
> > ever
> > > > > deleted and never modified. When I run the incremental, however, it
> > is
> > > > > taking hours to examine the files. Is there anything I can do to
> > speed
> > > > this
> > > > > up? I have what should be plenty of memory (32GB). Job and FileSet
> > > > > definitions are below.
> > > > >
> > > > > Job {
> > > > >         Name = "Big Files"
> > > > >         Write Bootstrap = "/var/db/bacula/bigfiles-%s-%e.bsr"
> > > > >         JobDefs = DefaultJob
> > > > >         Type = Backup
> > > > >         Level = Incremental
> > > > >         Client = FreeNAS-fd
> > > > >         FileSet = "Big Files"
> > > > >         Spool Data = yes
> > > > >         Storage = TapeLibrary
> > > > >         Pool = File
> > > > >         Accurate = yes
> > > > >         Schedule=LargeFileCycle
> > > > >         Priority = 12
> > > > > }
> > > > >
> > > > > FileSet {
> > > > >   Name = "Big Files"
> > > > >   Include {
> > > > >    Options {
> > > > >     signature = MD5
> > > > >     Accurate=pins5 #should allow keeping track of deleted/renamed
> > files.
> > > > >     verify = pins5 #compare permission bits, then inodes, number of
> > > > links,
> > > > > size, and finally MD5 changes
> > > > >     IgnoreCase = yes
> > > > >    }
> > > > >    File = "/media/DataSet1/Big Files/"
> > > > >   }
> > > > > }
> > > > >
> > > > > Thanks,
> > > > >
> > > > > B. Smith
> > > > >
> > > >
> > > >
> > > > _______________________________________________
> > > > Bacula-users mailing list
> > > > Bacula-users@lists.sourceforge.net
> > > > https://lists.sourceforge.net/lists/listinfo/bacula-users
> > > >
> > >
> >
> 


_______________________________________________
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users

Reply via email to