Hi,

I added "Client Run Before/After Job" scripts to two backup jobs in
order to stop/start mysql.  But the "After" script seems to hang the
bacula-fd. Using 'status dir', the bacula console shows a terminated
status for these jobs, and the command 'list jobs' shows their status as
'R'.

The script being called is the FreeBSD rc script used to start/stop
mysql: /usr/local/etc/rc.d/mysql-server [start|stop].  I've ensured that
this script works when run manually, and that it returns 0.

When I restart the offending bacula-fd's (kill doesn't work; requires
kill -9), the offending jobs finish with errors and the rest of my
queued jobs begin to run.

Any ideas what's causing this?  Suggestions for fixing it?

Here are some details:

Director OS: FreeBSD 6.2-RELEASE
File Daemon OS: FreeBSD 6.1-RELEASE
bacula-dir/sd: 2.2.4
bacula-fd: 2.2.4
catalog: MySQL 5.0

Total # of jobs: 13, of which...
   7 "Priority = 10"
   5 "Priority = 12"
   1 "Priority = 20"

bacula-dir.conf: Director{Maximum Concurrent Jobs=10;...}
bacula-sd.conf: Storage{Maximum Concurrent Jobs=20;...}

Offending Client Run Before/After Job scripts:
  Client Run Before Job = "/usr/local/etc/rc.d/mysql-server stop"
  Client Run After Job = "/usr/local/etc/rc.d/mysql-server start"

logged output on the director while job is hung:
   zeus-dir: sql_find.c:134 No Job record found: ERR=
   CMD=SELECT StartTime FROM Job WHERE JobStatus='T' AND Type='B' AND Level='F' 
AND Name='ritin' AND ClientId=5 AND FileSetId=5 ORDER BY StartTime DESC LIMIT 1
   zeus-dir: No prior or suitable Full backup found in catalog. Doing FULL 
backup.
   zeus-dir: Start Backup JobId 31, Job=ritin.2007-10-30_05.05.04
   zeus-dir: Created new Volume "ritin-Full-0002" in catalog.
   zeus-dir: Using Device "ritinFileStorage"
   ritin-fd: ClientRunBeforeJob: run command "/usr/local/etc/rc.d/mysql-server 
stop"
   ritin-fd: ClientRunBeforeJob: Stopping mysql.
   ritin-fd: ClientRunBeforeJob: Waiting for PIDS: 96261.
   zeus-sd: Labeled new Volume "ritin-Full-0002" on device "ritinFileStorage" 
(/bacula/disk2).
   zeus-sd: Wrote label to prelabeled Volume "ritin-Full-0002" on device 
"ritinFileStorage" (/bacula/disk2)
   zeus-dir: Max Volume jobs exceeded. Marking Volume "ritin-Full-0002" as Used.
   ritin-fd:      Disallowed filesystem. Will not descend from / into /dev
   zeus-sd: Job write elapsed time = 00:57:03, Transfer rate = 9.499 M 
bytes/second
   ritin-fd: ClientAfterJob: run command "/usr/local/etc/rc.d/mysql-server 
start"
   ritin-fd: ClientAfterJob: Starting mysql.


'status dir' output while job on 'ritin' is hung (truncated):

Running Jobs:
 JobId Level   Name                       Status
======================================================================
    31 Full    ritin.2007-10-30_05.05.04 has terminated
    34 Full    comdev.2007-10-30_05.05.07 is waiting for higher priority jobs 
to finish
    35 Full    comstag.2007-10-30_05.05.08 is waiting execution


Dane
-- 
Dane Miller
Systems Administrator
Great Schools, Inc
http://greatschools.net


-------------------------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now >> http://get.splunk.com/
_______________________________________________
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users

Reply via email to