On Mon, Jun 5, 2023 at 3:07 AM Nick Holland <n...@holland-consulting.net> wrote: > > =------------------------- > PREVIOUS=(find previous backup) > TODAY=(today's date) > OLDEST=(find oldest backup in the set) > REMOTE=(machine we are backing up) > > # remove oldest backup > rm -r $OLDEST & > > mkdir $TODAY > > # make new backup > rsync --link-dest $PREVIOUS $REMOTE $TODAY > =------------------------- > > [REDACTED] > > Here's where it gets weird -- removing the '&' after the rm -r $OLDEST > line seems to have FIXED THE PROBLEM. No problems in 18 days, which is > a pretty good record. >
Just spitballing here... you were running the removal of the oldest in the background while bringing in new data for the backup. Maybe it was hitting an I/O ceiling of some kind under those conditions? May still warrant investigation since it could still hit this "ceiling" under a big enough I/O load. -- Aaron Mason - Programmer, open source addict I've taken my software vows - for beta or for worse