Hi all, I'm currently backing up mostly remote hosts. Generally our connectivity is good. Sometimes it's not... especially at night I find that our backup window coincides with someone's maintenance window.
How does bacula deal with this? It didn't look like it retried the job, which wouldn't really bother me on a daily incremental run, but on a full run that needs to happen overnight or over the weekend when no one is in the office, putting the job off until the next run and hoping that someone will remove the daily tape and put in a tape from the full backup pool could be problematic. What's the best way to deal with this? And on a slightly different topic: Currently I have one last full job that needs to run before tomorrow. I've manually scheduled it after it failed two nights in a row, but it never seems to actually kick off. I get the following messages: 15-Jul 22:48 devel2-dir: No prior Full backup Job record found. 15-Jul 22:48 devel2-dir: No prior or suitable Full backup found in catalog. Doing FULL backup. 15-Jul 22:48 devel2-dir: Start Backup JobId 165, Job=b02.2007-07-15_22.48.33 15-Jul 22:48 devel2-sd: Volume "Monthly0001" previously written, moving to end of data. And then it sits there, for hours. I see no traffic to the client during this time, no sign of the spool file being created, and no real db activity. I can't look at the tape status since the SD seems to block "mt" from talking to the drive. This is an LTO-2 drive, so I wouldn't expect it to take hours to find the last file on the tape... If I look at the status of the director, I see the job listed as "running": Running Jobs: JobId Level Name Status ====================================================================== 165 Full b02.2007-07-15_22.48.33 is running ==== However if I do "status storage" it sits there and then eventually returns this error: ---- Connecting to Storage daemon Tape at devel2:9103 Failed to connect to Storage daemon Tape. ==== You have messages. *mess 15-Jul 22:44 devel2-dir: *Console*.2007-07-15_22.15.30 Fatal error: authenticate.c:113 Director unable to authenticate with Storage daemon on "devel2:9103". Possible causes: Passwords or names not the same or Maximum Concurrent Jobs exceeded on the SD or SD networking messed up (restart daemon). Please see http://www.bacula.org/rel-manual/faq.html#AuthorizationErrors for help. ---- I do not run concurrent jobs, the SD is set to take up to 30 though, and the password is correct. After restarting the SD, this is in the message queue: ---- 15-Jul 22:44 devel2-dir: *Console*.2007-07-15_22.15.30 Error: bnet.c:439 Write error sending 23 bytes to Storage daemon:devel2.biglist.com:9103: ERR=Broken pipe 15-Jul 22:44 b02-fd: b02.2007-07-15_22.18.52 Fatal error: job.c:1749 Comm error with SD. bad response to Append Data. ERR=Resource temporarily unavailable ---- Any ideas? Thanks for your patience, Charles ------------------------------------------------------------------------- This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ _______________________________________________ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users