Hi all,

I had a longtime running bacula 1.38.11 environment running on FreeBSD  
hosts that worked flawlessly.  Recently I had a drive failure on one  
of the clients, rebuilt it from scratch and have been unable to get a  
successful backup since.

Ironically, I started having issues with the server and decided it was  
time for a fresh reinstall of bacula, including an upgrade to 2.2.7 on  
all servers and clients.  This was a fresh install, new database and  
everything.

The client in question has 3 separate backup jobs assigned to it.  2  
of 3 work flawlessly.  The third, also the largest, appears to run  
successfully until the very end when a 'status director' spits out the  
following:

*st dir
ducati-dir Version: 2.2.7 (24 December 2007) i386-unknown-freebsd6.2  
freebsd 6.2-RELEASE-p8
Daemon started 03-Jan-08 15:45, 10 Jobs run since started.
  Heap: heap=1,126,400 smbytes=211,929 max_bytes=212,400 bufs=1,013  
max_bufs=1,020

Scheduled Jobs:
Level          Type     Pri  Scheduled          Name                
Volume
= 
= 
= 
= 
= 
= 
= 
= 
= 
= 
= 
========================================================================
Incremental    Backup    10  09-Jan-08 01:05    Ducati Root         
*unknown*
Incremental    Backup    10  09-Jan-08 01:05    Benelli DB          
*unknown*
Incremental    Backup    10  09-Jan-08 01:05    Benelli Root        
*unknown*
Incremental    Backup    10  09-Jan-08 01:05    Astoria Web         
*unknown*
Incremental    Backup    10  09-Jan-08 01:05    Astoria IMAP        
*unknown*
Incremental    Backup    10  09-Jan-08 01:05    Astoria Root        
*unknown*
Full           Backup    11  09-Jan-08 01:15    BackupCatalog       
*unknown*
====

Running Jobs:
  JobId Level   Name                       Status
======================================================================
     43 Full    BackupCatalog.2008-01-04_01.15.09 is waiting for  
higher priority jobs to finish
     47 Full    Astoria_Web.2008-01-05_01.05.13 has terminated
     ...
====

Terminated Jobs:
  JobId  Level    Files      Bytes   Status   Finished        Name
====================================================================
     37  Incr      1,204    223.7 M  OK       04-Jan-08 01:06  
Ducati_Root
     38  Incr         12    4.120 M  OK       04-Jan-08 01:07  
Astoria_Root
     41  Incr          6    13.30 K  OK       04-Jan-08 01:07  
Benelli_Root
     39  Incr        250    47.12 M  OK       04-Jan-08 01:09  
Astoria_IMAP
     42  Incr          0         0   OK       04-Jan-08 01:09 Benelli_DB
     40  Full     44,265    3.425 G  Cancel   06-Jan-08 11:22  
Astoria_Web
     44  Incr      1,110    203.9 M  OK       06-Jan-08 11:24  
Ducati_Root
     45  Incr         71    4.791 M  OK       06-Jan-08 11:25  
Astoria_Root
     46  Incr        182    45.71 M  OK       06-Jan-08 11:26  
Astoria_IMAP
     48  Incr          6    13.53 K  OK       06-Jan-08 11:26  
Benelli_Root

====

Please notice under the Running Jobs: section, the "Astoria_Web" Job  
is indicating "terminated."

Now, querying the status of the client itself:

*st client=astoria-fd
Connecting to Client astoria-fd at astoria.monville.net:9102

astoria-fd Version: 2.2.7 (24 December 2007)  i386-unknown-freebsd6.2  
freebsd 6.2-RELEASE-p9
Daemon started 06-Jan-08 11:24, 2 Jobs run since started.
  Heap: heap=815,104 smbytes=145,440 max_bytes=484,217 bufs=102  
max_bufs=168
  Sizeof: boffset_t=8 size_t=4 debug=0 trace=0

Running Jobs:
JobId 47 Job Astoria_Web.2008-01-05_01.05.13 is running.
     Backup Job started: 06-Jan-08 11:28
     Files=44,934 Bytes=3,426,818,210 Bytes/sec=18,871 Errors=0
     Files Examined=44,934
     Processing file: /wwwroot
     SDReadSeqNo=9 fd=5
Director connected at: 08-Jan-08 13:54
====

Terminated Jobs:
  JobId  Level    Files      Bytes   Status   Finished        Name
======================================================================
     45  Incr         71    4.791 M  OK       06-Jan-08 11:26  
Astoria_Root
     46  Incr        182    45.71 M  OK       06-Jan-08 11:28  
Astoria_IMAP
====

The client appears to believe that the job is still running, but  
always fails while processing file "/wwwroot" even though that is the  
only filesystem in the FileSet. (And I believe everything in it has  
already been backed up.)

The client lives outside my firewall and I am using TLS encryption,  
although I have tested with it disabled.  I currently have the  
"Heartbeat Interval" set to 60 seconds on both the client and the  
storage daemon.

The only way to "free" up the storage daemon so other jobs can run is  
to restart the client fd, then cancel the job at the director, but  
then the next time it runs it does another Full backup, upgraded from  
Incr.

I'm at my wits end as to why this is failing in this manner and am  
hoping someone can point me in the right direction.

Thanks,
Ken

--
Ken Monville
[EMAIL PROTECTED]




-------------------------------------------------------------------------
Check out the new SourceForge.net Marketplace.
It's the best place to buy or sell services for
just about anything Open Source.
http://ad.doubleclick.net/clk;164216239;13503038;w?http://sf.net/marketplace
_______________________________________________
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users

Reply via email to