Hi Rob,
Am 30.12.2024 um 04:34 schrieb Rob Gerber:
Happy new year, everyone!
I have a testing environment that I'm assembling for a client. Bacula
15.0.2, running in a rocky linux vm on a synology DS423+.
I have tested backups with a windows FD using a windows VM running
windows 11. This went well. A few days ago I added one of my windows 11
workstations as an FD. This FD has about 1.44TB of space used. The FD
name is 'akita'.
Because I'd just added the FD client to akita, I started a full backup.
Overnight, the routinely scheduled incremental backup queued up behind
the full backup. Looking at the logs, the scheduled incremental waited
until 20 seconds after the manually started full backup of akita had
concluded. In such a situation I would expect an incremental to complete
with very few files. However, this is not what happened. The incremental
was upgraded to a full because 'no suitable full backup was found'.
The backup level to run the new backup with is determined when the job
gets put into the work queue, at which point the previous job was still
running, thus there really was no suitable full backup.
To avoid this can be very easy -- if the available controls for job
concurrency are used. See Allow Duplicate Jobs settings and related.
This
would be reasonable, except such full backup literally just finished.
Worse, the upgraded incremental had a number of VSS writer errors that
resulted in a backup that was incomplete. The first, correct full backup
clocked in at 1.3TB in size. The erroneously upgraded incremental backup
was about 300GB. Even worse, the second full finished with a T status,
If individual files or anything can not be backed up, Bacula does not
make this a job failure. The "with warnings" note at the Termination
code is quite relevenat. Also, not that the actual number of failures
reported by the FD is in the Job report. This is quite important
information that unfortunately does not make it into the single letter
code but can easily be derived from the catalog, the job report, or log
files.
indicating a successful backup. If I hadn't known better, I would have
thought that this was a good backup based on the status.
What on earth is going on here? Why didn't bacula recognize the
previously ran full backup as valid? Any idea why many of the VSS
writers failed? It's particularly concerning that the job finished with
a T status despite so many VSS writers failing,
VSS is not something the FD controls. I would recommend checking
Windows' logs for anything interesting.
Also, as you already concluded, Accurate mode is a pretty useful thing,
although it does have its drawbacks.
...
I do wonder if the whole problem was kicked off
because bacula didn't have time to properly quantify the client's backup
state before the scheduled incremental backup started,
I doubt that.
but I can't
imagine how to avoid such a problem occurring again if a backup runs
long enough. I have had backups run for WEEKS and haven't had this
problem before, on a linux system.
Duplicate job control, as I mentioned above.
...
I note that many of the items listed as having issues are in c:\backups
regarding a files and folders dump from a previous windows machine
(minerva-w10). I briefly wondered if maybe the issue is because there is
a parallel windows system in that folder, but I see also that it failed
to back up an mrimg image file containing that same system. the system
should certainly be unaware of the contents of the image file.
May be worth checking ACLs and the account your file daemon runs under;
Windows can be very particular and tricky in this respect (I'm currently
working with a few Windows admins who are trying to get their Windows
user and permission setup sorted out because it has developed into
something rather messy, and seeing some serious experts being scared of
Windows' way to inherit, override, modify and otherwise work with its
complex account and permissions system is interesting).
One thing I found helpful diagnosing permissions issues on Windows was
to add a Run Script which calls
whoami /all to a job so you see the effective permissions of the running
FD in a job report.
There
were also issues with a number of other, unrelated files and folders,
including c:\games and c:\boot. Of course, the first full backup had no
issues with any of this.
I would welcome input from anyone with knowledge about this strange
behavior. Thank you in advance.
You're welcome :-)
Although I think only one issue is strange here, and that is all
Windows, so even the term "strange" could be disputed ;-)
Cheers,
Arno
Regards,
Robert Gerber
402-237-8692
r...@craeon.net <mailto:r...@craeon.net>
_______________________________________________
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users
--
Arno Lehmann
IT-Service Lehmann
Sandstr. 6, 49080 Osnabrück
_______________________________________________
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users