Hi, 08.08.2007 05:22,, Robert LeBlanc wrote:: > > > On 8/7/07 5:32 PM, "Charles Sprickman" <[EMAIL PROTECTED]> wrote: > >> Hi all, >> >> I'm having some trouble figuring out how to "catch up" when someone has >> forgotten to put a tape in or if I manually schedule a job that requires a >> different pool than what is in the tape. >> >> I think a real-world example is in order. My fulls are on the first >> weekend of the month, diffs each subsequent weekend, then incrementals on >> weekdays. No one is in the office sat/sun to change tapes. >> >> This past weekend I mistakenly asked for a tape from the weekly pool to be >> inserted. Unfortunately, I had forgotten this was a new month. So on >> Sunday afternoon when bacula was going to do a run, it wanted to do a Full >> and it wanted a tape from the Monthly pool. No one was around, so the >> jobs did not start. >> >> Monday I asked for someone to put in the next Monthly tape, but then that >> night bacula wanted a Daily. >> >> This is where I get confused. If a job fails simply due to the wrong >> tape, how do I make bacula re-run the job and run it to the appropriate >> pool? If I let this slide, is bacula simply going to wait until the first >> weekend in September to do a full run? I'd really like to get one in >> ASAP. > > I've just run the jobs manually and modified the job to be the right level > and the right pool. Kind of a pain sometimes when a lot of jobs fail (we > have almost 30 clients). I would be interested in a batch restart too.
Usually, Bacula simply waits until it can run the stalled jobs. There is a hard-coded maximum wait time, though, and a high number of failing mounts will also end the job with an error. What this means is that, unless you configured things like maximum wait or run times or automatic polling / mounting, you can load the right tape and Bacula will run the jobs that are held. If you configured Bacula to fail stalled jobs after some time, the automatic rescheduling and "Rerun Failed Levels" can help a lot in your scenario. If you can't run full backups during the week, it's better to simply skip missed jobs IMO. >> This sort of mishandling of tapes will likely not be a one-time occurence, >> plus there's issues of people going on vacation and similar where there >> will be no operator on site to swap tapes. How do other people deal with >> this? What happens to these failed jobs in the catalog? Should they be >> deleted? Is there a way to reschedule them all? There are no means to retry a bunch of failed jobs, but it would be possible by scripting some console commands... like get a list of all failed jobs, take the highest backup level for each job, and feed the commands to run these jobs to bconsole. The better solution, IMO, is to define your backup procedure in a way that prevents these scenarios. Necessary tape changes during the weekend are a problem best solved by investing into an autochanger, though. If that's not an option, I prefer to have the days that require tape changing to be Tuesdays and Wednesdays - experience tells me that these are the days where it's least likely that no operator is present. Of course, having a single sheet of instructions for your operators is helpful, so they can easily look up which tapes (from which pool) to load on which day, and so on. I have a more or less sophisticated perl script that decides which tapes need to be unloaded and inserted and which can be used to produce clear instructions for the operators, but it's only really usable for autochangers. And it's still missing some important features... Anyway, my personal impression is that these problems are typically ones that should be tackled by writing the necessary operations manual. All attempts I know to solve these issues by complex configurations tend to break in some cases, and require intervention by an administrator anyway. >> Another thing that I have not figured out is how to see what bacula thinks >> it's next run will be (what hosts, what level, what pool). I'd like to >> know this for troubleshooting purposes as well as to try and script >> something to give people an advance warning about what tape should be in >> the drive each night. > > You can do a status on the director and it will tell you all that info in > the top portion of the screen except the pool (it does tell you the tape it > thinks it will use which can change if the tape fills up) > >> And lastly, any plans to have the spool act like it does in Amanda? >> Meaning that if you have the space and you don't have the right tape in, >> bacula will spool all the jobs until the right tape ends up in the drive. >> Or perhaps it is possible in some way that I'm not seeing. > > That is a cool feature that would be pretty nifty. We have a changer so it's > not as big a del, but it sounds like a great feature I would recommend to investigate if migration is the right approach for you, then - set up a disk to disk to tape backup. In the first pass, backup from your clients to the disk based volumes, and then migrate these backups to tape later. The migration can be done during the work hours, so that an operator can easily change tapes or ask for help. In this case, you'd need the disk space to hold all or most of your backups, though. >> Any help is appreciated, we're very happy so far with bacula but for this >> little issue of our sneakernet changer not being 100% reliable. :) Well, my usual advice is to get most of your backups run automatically without intervention, and don't use too much time and effort to get the remaining problems solved by Bacula but rather by educating your operators :-) Arno >> Thanks, >> >> Charles >> >> ------------------------------------------------------------------------- >> This SF.net email is sponsored by: Splunk Inc. >> Still grepping through log files to find problems? Stop. >> Now Search log events and configuration files using AJAX and a browser. >> Download your FREE copy of Splunk now >> http://get.splunk.com/ >> _______________________________________________ >> Bacula-users mailing list >> Bacula-users@lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/bacula-users >> > > > Robert LeBlanc > College of Life Sciences Computer Support > Brigham Young University > [EMAIL PROTECTED] > (801)422-1882 > > > > ------------------------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. > Still grepping through log files to find problems? Stop. > Now Search log events and configuration files using AJAX and a browser. > Download your FREE copy of Splunk now >> http://get.splunk.com/ > _______________________________________________ > Bacula-users mailing list > Bacula-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/bacula-users -- Arno Lehmann IT-Service Lehmann www.its-lehmann.de ------------------------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/ _______________________________________________ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users