Hi,

08.08.2007 05:22,, Robert LeBlanc wrote::
> 
> 
> On 8/7/07 5:32 PM, "Charles Sprickman" <[EMAIL PROTECTED]> wrote:
> 
>> Hi all,
>>
>> I'm having some trouble figuring out how to "catch up" when someone has
>> forgotten to put a tape in or if I manually schedule a job that requires a
>> different pool than what is in the tape.
>>
>> I think a real-world example is in order.  My fulls are on the first
>> weekend of the month, diffs each subsequent weekend, then incrementals on
>> weekdays.  No one is in the office sat/sun to change tapes.
>>
>> This past weekend I mistakenly asked for a tape from the weekly pool to be
>> inserted.  Unfortunately, I had forgotten this was a new month.  So on
>> Sunday afternoon when bacula was going to do a run, it wanted to do a Full
>> and it wanted a tape from the Monthly pool.  No one was around, so the
>> jobs did not start.
>>
>> Monday I asked for someone to put in the next Monthly tape, but then that
>> night bacula wanted a Daily.
>>
>> This is where I get confused.  If a job fails simply due to the wrong
>> tape, how do I make bacula re-run the job and run it to the appropriate
>> pool?  If I let this slide, is bacula simply going to wait until the first
>> weekend in September to do a full run?  I'd really like to get one in
>> ASAP.
> 
> I've just run the jobs manually and modified the job to be the right level
> and the right pool. Kind of a pain sometimes when a lot of jobs fail (we
> have almost 30 clients). I would be interested in a batch restart too.

Usually, Bacula simply waits until it can run the stalled jobs. There 
is a hard-coded maximum wait time, though, and a high number of 
failing mounts will also end the job with an error.

What this means is that, unless you configured things like maximum 
wait or run times or automatic polling / mounting, you can load the 
right tape and Bacula will run the jobs that are held.

If you configured Bacula to fail stalled jobs after some time, the 
automatic rescheduling and "Rerun Failed Levels" can help a lot in 
your scenario. If you can't run full backups during the week, it's 
better to simply skip missed jobs IMO.

>> This sort of mishandling of tapes will likely not be a one-time occurence,
>> plus there's issues of people going on vacation and similar where there
>> will be no operator on site to swap tapes.  How do other people deal with
>> this?  What happens to these failed jobs in the catalog?  Should they be
>> deleted?  Is there a way to reschedule them all?

There are no means to retry a bunch of failed jobs, but it would be 
possible by scripting some console commands... like get a list of all 
failed jobs, take the highest backup level for each job, and feed the 
commands to run these jobs to bconsole.

The better solution, IMO, is to define your backup procedure in a way 
that prevents these scenarios. Necessary tape changes during the 
weekend are a problem best solved by investing into an autochanger, 
though.

If that's not an option, I prefer to have the days that require tape 
changing to be Tuesdays and Wednesdays - experience tells me that 
these are the days where it's least likely that no operator is present.

Of course, having a single sheet of instructions for your operators is 
helpful, so they can easily look up which tapes (from which pool) to 
load on which day, and so on.

I have a more or less sophisticated perl script that decides which 
tapes need to be unloaded and inserted and which can be used to 
produce clear instructions for the operators, but it's only really 
usable for autochangers. And it's still missing some important features...

Anyway, my personal impression is that these problems are typically 
ones that should be tackled by writing the necessary operations 
manual. All attempts I know to solve these issues by complex 
configurations tend to break in some cases, and require intervention 
by an administrator anyway.

>> Another thing that I have not figured out is how to see what bacula thinks
>> it's next run will be (what hosts, what level, what pool).  I'd like to
>> know this for troubleshooting purposes as well as to try and script
>> something to give people an advance warning about what tape should be in
>> the drive each night.
> 
> You can do a status on the director and it will tell you all that info in
> the top portion of the screen except the pool (it does tell you the tape it
> thinks it will use which can change if the tape fills up)
> 
>> And lastly, any plans to have the spool act like it does in Amanda?
>> Meaning that if you have the space and you don't have the right tape in,
>> bacula will spool all the jobs until the right tape ends up in the drive.
>> Or perhaps it is possible in some way that I'm not seeing.
> 
> That is a cool feature that would be pretty nifty. We have a changer so it's
> not as big a del, but it sounds like a great feature

I would recommend to investigate if migration is the right approach 
for you, then - set up a disk to disk to tape backup.

In the first pass, backup from your clients to the disk based volumes, 
and then migrate these backups to tape later. The migration can be 
done during the work hours, so that an operator can easily change 
tapes or ask for help.

In this case, you'd need the disk space to hold all or most of your 
backups, though.

>> Any help is appreciated, we're very happy so far with bacula but for this
>> little issue of our sneakernet changer not being 100% reliable. :)

Well, my usual advice is to get most of your backups run automatically 
without intervention, and don't use too much time and effort to get 
the remaining problems solved by Bacula but rather by educating your 
operators :-)

Arno

>> Thanks,
>>
>> Charles
>>
>> -------------------------------------------------------------------------
>> This SF.net email is sponsored by: Splunk Inc.
>> Still grepping through log files to find problems?  Stop.
>> Now Search log events and configuration files using AJAX and a browser.
>> Download your FREE copy of Splunk now >>  http://get.splunk.com/
>> _______________________________________________
>> Bacula-users mailing list
>> Bacula-users@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/bacula-users
>>
> 
>  
> Robert LeBlanc
> College of Life Sciences Computer Support
> Brigham Young University
> [EMAIL PROTECTED]
> (801)422-1882
> 
> 
> 
> -------------------------------------------------------------------------
> This SF.net email is sponsored by: Splunk Inc.
> Still grepping through log files to find problems?  Stop.
> Now Search log events and configuration files using AJAX and a browser.
> Download your FREE copy of Splunk now >>  http://get.splunk.com/
> _______________________________________________
> Bacula-users mailing list
> Bacula-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/bacula-users

-- 
Arno Lehmann
IT-Service Lehmann
www.its-lehmann.de

-------------------------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now >>  http://get.splunk.com/
_______________________________________________
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users

Reply via email to