On Jan 8, 2008, at 2:36 PM, Dan Langille wrote: > > On Jan 8, 2008, at 2:30 PM, Ken Monville wrote: >> On Jan 8, 2008, at 2:12 PM, Dan Langille wrote: >> >>> Ken Monville wrote: >>>> Hi all, >>>> I had a longtime running bacula 1.38.11 environment running on >>>> FreeBSD hosts that worked flawlessly. Recently I had a drive >>>> failure on one of the clients, rebuilt it from scratch and have >>>> been unable to get a successful backup since. >>>> Ironically, I started having issues with the server and decided it >>>> was time for a fresh reinstall of bacula, including an upgrade to >>>> 2.2.7 on all servers and clients. This was a fresh install, new >>>> database and everything. >>>> The client in question has 3 separate backup jobs assigned to it. >>>> 2 of 3 work flawlessly. The third, also the largest, appears to >>>> run successfully until the very end when a 'status director' spits >>>> out the following: >>>> *st dir >>>> ducati-dir Version: 2.2.7 (24 December 2007) i386-unknown- >>>> freebsd6.2 freebsd 6.2-RELEASE-p8 >>>> Daemon started 03-Jan-08 15:45, 10 Jobs run since started. >>>> Heap: heap=1,126,400 smbytes=211,929 max_bytes=212,400 bufs=1,013 >>>> max_bufs=1,020 >>>> Scheduled Jobs: >>>> Level Type Pri Scheduled Name >>>> Volume >>>> = = = = = = = = = = = >>>> = >>>> = >>>> = >>>> = >>>> = >>>> =================================================================== >>>> Incremental Backup 10 09-Jan-08 01:05 Ducati Root >>>> *unknown* >>>> Incremental Backup 10 09-Jan-08 01:05 Benelli DB >>>> *unknown* >>>> Incremental Backup 10 09-Jan-08 01:05 Benelli Root >>>> *unknown* >>>> Incremental Backup 10 09-Jan-08 01:05 Astoria Web >>>> *unknown* >>>> Incremental Backup 10 09-Jan-08 01:05 Astoria IMAP >>>> *unknown* >>>> Incremental Backup 10 09-Jan-08 01:05 Astoria Root >>>> *unknown* >>>> Full Backup 11 09-Jan-08 01:15 BackupCatalog >>>> *unknown* >>>> ==== >>>> Running Jobs: >>>> JobId Level Name Status >>>> = >>>> = >>>> = >>>> =================================================================== >>>> 43 Full BackupCatalog.2008-01-04_01.15.09 is waiting for >>>> higher priority jobs to finish >>>> 47 Full Astoria_Web.2008-01-05_01.05.13 has terminated >>>> ... >>>> ==== >>>> Terminated Jobs: >>>> JobId Level Files Bytes Status Finished Name >>>> = >>>> =================================================================== >>>> 37 Incr 1,204 223.7 M OK 04-Jan-08 01:06 >>>> Ducati_Root >>>> 38 Incr 12 4.120 M OK 04-Jan-08 01:07 >>>> Astoria_Root >>>> 41 Incr 6 13.30 K OK 04-Jan-08 01:07 >>>> Benelli_Root >>>> 39 Incr 250 47.12 M OK 04-Jan-08 01:09 >>>> Astoria_IMAP >>>> 42 Incr 0 0 OK 04-Jan-08 01:09 >>>> Benelli_DB >>>> 40 Full 44,265 3.425 G Cancel 06-Jan-08 11:22 >>>> Astoria_Web >>>> 44 Incr 1,110 203.9 M OK 06-Jan-08 11:24 >>>> Ducati_Root >>>> 45 Incr 71 4.791 M OK 06-Jan-08 11:25 >>>> Astoria_Root >>>> 46 Incr 182 45.71 M OK 06-Jan-08 11:26 >>>> Astoria_IMAP >>>> 48 Incr 6 13.53 K OK 06-Jan-08 11:26 >>>> Benelli_Root >>>> ==== >>>> Please notice under the Running Jobs: section, the "Astoria_Web" >>>> Job is indicating "terminated." >>>> Now, querying the status of the client itself: >>>> *st client=astoria-fd >>>> Connecting to Client astoria-fd at astoria.monville.net:9102 >>>> astoria-fd Version: 2.2.7 (24 December 2007) i386-unknown- >>>> freebsd6.2 freebsd 6.2-RELEASE-p9 >>>> Daemon started 06-Jan-08 11:24, 2 Jobs run since started. >>>> Heap: heap=815,104 smbytes=145,440 max_bytes=484,217 bufs=102 >>>> max_bufs=168 >>>> Sizeof: boffset_t=8 size_t=4 debug=0 trace=0 >>>> Running Jobs: >>>> JobId 47 Job Astoria_Web.2008-01-05_01.05.13 is running. >>>> Backup Job started: 06-Jan-08 11:28 >>>> Files=44,934 Bytes=3,426,818,210 Bytes/sec=18,871 Errors=0 >>>> Files Examined=44,934 >>>> Processing file: /wwwroot >>>> SDReadSeqNo=9 fd=5 >>>> Director connected at: 08-Jan-08 13:54 >>>> ==== >>>> Terminated Jobs: >>>> JobId Level Files Bytes Status Finished Name >>>> = >>>> = >>>> = >>>> =================================================================== >>>> 45 Incr 71 4.791 M OK 06-Jan-08 11:26 >>>> Astoria_Root >>>> 46 Incr 182 45.71 M OK 06-Jan-08 11:28 >>>> Astoria_IMAP >>>> ==== >>>> The client appears to believe that the job is still running, but >>>> always fails while processing file "/wwwroot" even though that is >>>> the only filesystem in the FileSet. (And I believe everything in >>>> it has already been backed up.) >>> >>> I suspect the job has finished backing up, and is now spooling file >>> attributes to the database. Check your database server load to >>> verify. >>> >>> Are your jobs set up to spool file attributes? If so, this occurs >>> at the end of the job. >>> >>>> The client lives outside my firewall and I am using TLS >>>> encryption, although I have tested with it disabled. I currently >>>> have the "Heartbeat Interval" set to 60 seconds on both the client >>>> and the storage daemon. >>>> The only way to "free" up the storage daemon so other jobs can run >>>> is to restart the client fd, then cancel the job at the director, >>>> but then the next time it runs it does another Full backup, >>>> upgraded from Incr. >>>> I'm at my wits end as to why this is failing in this manner and am >>>> hoping someone can point me in the right direction. >>> >>> Perhaps nothing is wrong at all. :) > >> Hi Dan, >> >> Double checking my configuration, there is no "Spool Attributes" >> parameter set anywhere, so I assume its off. The database server has >> virtually no load and I don't notice anything when the job is hung. >> Also, it stays in this state indefinitely until I restart the fd as I >> mentioned in this first post... >> >> Thanks again, >> Ken > > Is spooling mentioned at all. If so, spooling of file attributes > will be done. >
Nope, no spooling is enabled. I am backing up to disk, so I didn't enable spooling. > http://www.bacula.org/rel-manual/Data_Spooling.html > > Do you have the emailed job report from this job? If so, please > paste. > 05-Jan 01:05 ducati-dir JobId 47: No prior Full backup Job record found. 05-Jan 01:05 ducati-dir JobId 47: No prior or suitable Full backup found in catalog. Doing FULL backup. 06-Jan 11:27 ducati-dir JobId 47: Start Backup JobId 47, Job=Astoria_Web.2008-01-05_01.05.13 06-Jan 11:27 ducati-dir JobId 47: Created new Volume "Full-0012" in catalog. 06-Jan 11:27 ducati-dir JobId 47: Using Device "FileStorage" 06-Jan 11:28 astoria-fd JobId 47: ClientBeforeJob: run command "/usr/ local/etc/rc.d/mysql stop" 06-Jan 11:28 astoria-fd JobId 47: ClientBeforeJob: mysqld 06-Jan 11:27 ducati-sd JobId 47: Labeled new Volume "Full-0012" on device "FileStorage" (/bacula). 06-Jan 11:27 ducati-sd JobId 47: Wrote label to prelabeled Volume "Full-0012" on device "FileStorage" (/bacula) 06-Jan 13:36 ducati-sd JobId 47: Job write elapsed time = 02:09:11, Transfer rate = 442.9 K bytes/second 06-Jan 13:37 astoria-fd JobId 47: ClientAfterJob: run command "/usr/ local/etc/rc.d/mysql start" 08-Jan 14:09 ducati-dir JobId 47: Fatal error: Network error with FD during Backup: ERR=Broken pipe 08-Jan 14:09 ducati-dir JobId 47: Error: Bacula ducati-dir 2.2.7 (24Dec07): 08-Jan-2008 14:09:54 Build OS: i386-unknown-freebsd6.2 freebsd 6.2-RELEASE-p8 JobId: 47 Job: Astoria_Web.2008-01-05_01.05.13 Backup Level: Full (upgraded from Incremental) Client: "astoria-fd" 2.2.7 (24Dec07) i386-unknown- freebsd6.2,freebsd,6.2-RELEASE-p9 FileSet: "Astoria Web" 2008-01-01 10:39:25 Pool: "Full-Pool" (From Job FullPool override) Storage: "OutsideFile" (From Job resource) Scheduled time: 05-Jan-2008 01:05:00 Start time: 06-Jan-2008 11:27:08 End time: 08-Jan-2008 14:09:54 Elapsed time: 2 days 2 hours 42 mins 46 secs Priority: 10 FD Files Written: 44,934 SD Files Written: 44,934 FD Bytes Written: 3,426,818,210 (3.426 GB) SD Bytes Written: 3,432,986,097 (3.432 GB) Rate: 18.8 KB/s Software Compression: None VSS: no Encryption: no Volume name(s): Full-0012 Volume Session Id: 11 Volume Session Time: 1199393132 Last Volume Bytes: 3,436,447,127 (3.436 GB) Non-fatal FD errors: 0 SD Errors: 0 FD termination status: Error SD termination status: OK Termination: *** Backup Error *** When I saw the "Broken Pipe" error, I search the archives and came up with the "Heartbeat Interval" setting, which has had no effect. Ken -- Ken Monville [EMAIL PROTECTED] ------------------------------------------------------------------------- Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://ad.doubleclick.net/clk;164216239;13503038;w?http://sf.net/marketplace _______________________________________________ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users