After some more testing it seems that the backup also stops when only running alone.
I cannot see any pattern now. It seems thats jobs with small amounts and large amounts of data/files have problems running alone or concurrent with other jobs. The client in this job is the same server that are "master-bacula" so the problem have nothing to do with the network. -- 13-Sep 20:20 bacula1-dir: Start Backup JobId xxJob=CLIENT-Job.2007-09-13_20.20.39 13-Sep 20:20 bacula1-dir: There are no more Jobs associated with Volume "CLIENT-Daily-0001". Marking it purged. 13-Sep 20:20 bacula1-dir: All records pruned from Volume "CLIENT-Daily-0001"; marking it "Purged" 13-Sep 20:20 bacula1-dir: Recycled volume "CLIENT-Daily-0001" 13-Sep 20:20 bacula1-dir: Using Device "CLIENT-Device" 13-Sep 20:20 bacula1-sd: Recycled volume "CLIENT-Daily-0001" on device "CLIENT-Device" (/PATH/CLIENT), all previous data lost. 13-Sep 20:20 bacula1-dir: Volume used once. Marking Volume "CLIENT-Daily-0001" as Used. 13-Sep 21:19 bacula1-dir: CLIENT-Job.2007-09-13_20.20.39 Fatal error: Max wait time exceeded. Job canceled. 13-Sep 21:19 bacula1-fd: CLIENT-Job.2007-09-13_20.20.39 Fatal error: backup.c:892 Network send error to SD. ERR=Success 13-Sep 21:19 bacula1-sd: Job CLIENT-Job.2007-09-13_20.20.39 marked to be canceled. 13-Sep 21:19 bacula1-sd: CLIENT-Job.2007-09-13_20.20.39 Fatal error: append.c:259 Network error on data channel. ERR=No data available 13-Sep 21:19 bacula1-sd: Job write elapsed time = 00:58:43, Transfer rate = 2.400 M bytes/second 13-Sep 21:19 bacula1-dir: Bacula bacula1-dir 2.2.3 (09Sep07): 13-Sep-2007 21:19:26 Build OS: x86_64-unknown-linux-gnu redhat JobId: xx Job: CLIENT-Job.2007-09-13_20.20.39 Backup Level: Full Client: "CLIENT" 2.2.3 (09Sep07) x86_64-unknown-linux-gnu,redhat, FileSet: "CLIENT-FileSet" 2007-09-13 06:00:02 Pool: "CLIENT-Pool-Daily" (From Job resource) Storage: "CLIENT-Storage" (From Job resource) Scheduled time: 13-Sep-2007 20:20:37 Start time: 13-Sep-2007 20:20:41 End time: 13-Sep-2007 21:19:26 Elapsed time: 58 mins 45 secs Priority: 10 FD Files Written: 192 SD Files Written: 192 FD Bytes Written: 8,457,397,606 (8.457 GB) SD Bytes Written: 8,457,430,790 (8.457 GB) Rate: 2399.3 KB/s Software Compression: 74.6 % VSS: no Encryption: no Volume name(s): CLIENT-Daily-0001 Volume Session Id: 1 Volume Session Time: 1189666879 Last Volume Bytes: 8,468,272,411 (8.468 GB) Non-fatal FD errors: 0 SD Errors: 0 FD termination status: Canceled SD termination status: Canceled Termination: Backup Canceled -- Regards, Christian Sakshaug Christian Sakshaug skrev: > Hey > > Have tested some more now, it seems that jobs that I was getting error > after "normal schedule" is working when I have a small number of > concurrent jobs running. > > I wonder what have been change since 2.0.3 that could have this affects? > > I have been reading the changelog and documentation and I can not find > anything special except the "critical bug fixed in 2.2.3" that trigger > my upgrade at the first place. > > Actually I have 3 different bacula site-location that have been in used > for some times now without troubles. I have upgrade two of those with > 2.2.3 and I having trouble with both. But with the third (running 2.0.3) > everything is working great/normal. > > > Regards > Christian Sakshaug > > > > Christoff van Zyl skrev: >> On this subject, I have upgraded from 2.0.3 to 2.2.3 and had the following >> problem. >> >> If you start a manual run, the first job start and when you start a second >> run >> the console freeze up for about 2 minutes and then everything is normal >> again, is this normal. >> >> Thanks >> Christoff >> >> >> On Thursday 13 September 2007 08:50:21 Christian Sakshaug wrote: >>> Hey >>> >>> After upgrading to 2.2.3 i'm getting a lot of problems with backup. With >>> 2.0.3 everything worked just great. >>> >>> Have around XX jobs running from around 00:30 to 06:00 to different >>> harddrive devices. The storage is against a enterprise DAS's with a fast >>> frontend server with lots of memory/cpu juice and the network is with >>> 1GBIT enterprise infrastructure all the way to the clients. >>> >>> This settings have been running without (i'm loving bacula) problem now >>> for some years now.. >>> >>> It seems for me that backup is running correctly but when finishing up >>> something happen with the communication to the FD and I think has >>> something do with last "critical patch" released 10 sept. >>> >>> Anyone knows if this is a bug or could be bad configuration (something >>> changes I have to do with the 2.0.3 configuration) ? >>> >>> >>> Regards, Christian >>> >>> >>> -- error message #1 against a local SD -- >>> >>> 13-Sep 06:00 bacula1-dir: Start Backup JobId xx, >>> Job=JOB-NAME.2007-09-13_06.00.03 >>> 13-Sep 06:00 bacula1-dir: Created new Volume "VOLUME-NAME" in catalog. >>> 13-Sep 06:00 bacula1-dir: Using Device "DEVICE-NAME" >>> 13-Sep 06:00 bacula1-sd: Labeled new Volume "VOLUME-NAME" on device >>> "DEVICE-NAME" (PATH/DEVICE-NAME). >>> 13-Sep 06:00 bacula1-sd: Wrote label to prelabeled Volume "VOLUME-NAME" >>> on device "DEVICE-NAME" (PATH/DEVICE-NAME) >>> 13-Sep 06:00 bacula1-dir: Volume used once. Marking Volume "VOLUME-NAME" >>> as Used. >>> 13-Sep 06:59 bacula1-dir: JOB-NAME.2007-09-13_06.00.03 Fatal error: Max >>> wait time exceeded. Job canceled. >>> 13-Sep 06:59 bacula1-fd: JOB-NAME.2007-09-13_06.00.03 Fatal error: >>> backup.c:892 Network send error to SD. ERR=Success >>> 13-Sep 06:59 bacula1-sd: Job JOB-NAME.2007-09-13_06.00.03 marked to be >>> canceled. >>> 13-Sep 06:59 bacula1-sd: JOB-NAME.2007-09-13_06.00.03 Fatal error: >>> append.c:259 Network error on data channel. ERR=No data available >>> 13-Sep 06:59 bacula1-sd: Job write elapsed time = 00:59:20, Transfer >>> rate = 2.846 M bytes/second >>> 13-Sep 06:59 bacula1-dir: Bacula bacula1-dir 2.2.3 (09Sep07): >>> 13-Sep-2007 06:59:24 >>> Build OS: x86_64-unknown-linux-gnu redhat >>> JobId: xx >>> Job: JOB-NAME.2007-09-13_06.00.03 >>> Backup Level: Full >>> Client: "CLIENT-NAME" 2.2.3 (09Sep07) >>> x86_64-unknown-linux-gnu,redhat, >>> FileSet: "CLIENT-NAME-FileSet" 2007-09-13 06:00:03 >>> Pool: "CLIENT-NAME-Pool-Daily" (From Run pool >>> override) Storage: "CLIENT-NAME-Storage" (From Job resource) >>> Scheduled time: 13-Sep-2007 06:00:02 >>> Start time: 13-Sep-2007 06:00:04 >>> End time: 13-Sep-2007 06:59:24 >>> Elapsed time: 59 mins 20 secs >>> Priority: 10 >>> FD Files Written: 1 >>> SD Files Written: 1 >>> FD Bytes Written: 10,131,928,399 (10.13 GB) >>> SD Bytes Written: 10,131,928,542 (10.13 GB) >>> Rate: 2846.0 KB/s >>> Software Compression: 22.1 % >>> VSS: no >>> Encryption: no >>> Volume name(s): VOLUME-NAME >>> Volume Session Id: 32 >>> Volume Session Time: 1189578554 >>> Last Volume Bytes: 10,139,969,387 (10.13 GB) >>> Non-fatal FD errors: 0 >>> SD Errors: 0 >>> FD termination status: Canceled >>> SD termination status: Canceled >>> Termination: Backup Canceled >>> >>> -- >>> >>> -- error message #2 against a windows client -- >>> >>> 13-Sep 04:00 bacula1-dir: Start Backup JobId xx, >>> Job=CLIENT-NAME2-Job.2007-09-13_04.00.00 >>> 13-Sep 04:00 bacula1-dir: Created new Volume "CLIENT-NAME2-Daily-0001" >>> in catalog. >>> 13-Sep 04:00 bacula1-dir: Using Device "CLIENT-NAME2-Device" >>> 13-Sep 04:00 bacula1-sd: Labeled new Volume "CLIENT-NAME2-Daily-0001" on >>> device "CLIENT-NAME2-Device" (PATH/CLIENT-NAME2). >>> 13-Sep 04:00 bacula1-sd: Wrote label to prelabeled Volume >>> "CLIENT-NAME2-Daily-0001" on device "CLIENT-NAME2-Device" >>> (PATH/CLIENT-NAME2) >>> 13-Sep 04:00 bacula1-dir: Volume used once. Marking Volume >>> "CLIENT-NAME2-Daily-0001" as Used. >>> 13-Sep 04:00 CLIENT-NAME2-fd: Generate VSS snapshots. Driver="VSS Win >>> 2003", Drive(s)="CG" >>> 13-Sep 04:59 bacula1-dir: CLIENT-NAME2-Job.2007-09-13_04.00.00 Fatal >>> error: Max wait time exceeded. Job canceled. >>> 13-Sep 04:59 CLIENT-NAME2-fd: CLIENT-NAME2-Job.2007-09-13_04.00.00 Fatal >>> error: ../../filed/backup.c:892 Network send error to SD. ERR=No error >>> 13-Sep 04:59 bacula1-sd: Job CLIENT-NAME2-Job.2007-09-13_04.00.00 marked >>> to be canceled. >>> 13-Sep 04:59 CLIENT-NAME2-fd: VSS Writer (BackupComplete): "System >>> Writer", State: 0x1 (VSS_WS_STABLE) >>> 13-Sep 04:59 CLIENT-NAME2-fd: VSS Writer (BackupComplete): "Microsoft >>> Exchange Writer", State: 0x1 (VSS_WS_STABLE) >>> 13-Sep 04:59 CLIENT-NAME2-fd: VSS Writer (BackupComplete): "IIS Metabase >>> Writer", State: 0x1 (VSS_WS_STABLE) >>> 13-Sep 04:59 CLIENT-NAME2-fd: VSS Writer (BackupComplete): "MSDEWriter", >>> State: 0x1 (VSS_WS_STABLE) >>> 13-Sep 04:59 CLIENT-NAME2-fd: VSS Writer (BackupComplete): "Event Log >>> Writer", State: 0x1 (VSS_WS_STABLE) >>> 13-Sep 04:59 CLIENT-NAME2-fd: VSS Writer (BackupComplete): "WMI Writer", >>> State: 0x1 (VSS_WS_STABLE) >>> 13-Sep 04:59 CLIENT-NAME2-fd: VSS Writer (BackupComplete): "BITS >>> Writer", State: 0x1 (VSS_WS_STABLE) >>> 13-Sep 04:59 CLIENT-NAME2-fd: VSS Writer (BackupComplete): "Removable >>> Storage Manager", State: 0x1 (VSS_WS_STABLE) >>> 13-Sep 04:59 CLIENT-NAME2-fd: VSS Writer (BackupComplete): "Registry >>> Writer", State: 0x1 (VSS_WS_STABLE) >>> 13-Sep 04:59 CLIENT-NAME2-fd: VSS Writer (BackupComplete): "COM+ REGDB >>> Writer", State: 0x1 (VSS_WS_STABLE) >>> 13-Sep 04:59 bacula1-sd: CLIENT-NAME2-Job.2007-09-13_04.00.00 Fatal >>> error: append.c:259 Network error on data channel. ERR=Connection reset >>> by peer >>> 13-Sep 04:59 bacula1-sd: Job write elapsed time = 00:59:20, Transfer >>> rate = 2.485 M bytes/second >>> 13-Sep 04:59 bacula1-dir: Bacula bacula1-dir 2.2.3 (09Sep07): >>> 13-Sep-2007 04:59:25 >>> Build OS: x86_64-unknown-linux-gnu redhat >>> JobId: xx >>> Job: CLIENT-NAME2-Job.2007-09-13_04.00.00 >>> Backup Level: Full >>> Client: "CLIENT-NAME2" 2.2.3 (09Sep07) >>> Linux,Cross-compile,Win32 >>> FileSet: "CLIENT-NAME2-FileSet" 2007-09-13 04:00:00 >>> Pool: "CLIENT-NAME2-Pool-Daily" (From Run pool >>> override) >>> Storage: "CLIENT-NAME2-Storage" (From Job resource) >>> Scheduled time: 13-Sep-2007 04:00:00 >>> Start time: 13-Sep-2007 04:00:04 >>> End time: 13-Sep-2007 04:59:25 >>> Elapsed time: 59 mins 21 secs >>> Priority: 10 >>> FD Files Written: 9 >>> SD Files Written: 9 >>> FD Bytes Written: 8,848,250,993 (8.848 GB) >>> SD Bytes Written: 8,848,252,310 (8.848 GB) >>> Rate: 2484.8 KB/s >>> Software Compression: 28.3 % >>> VSS: yes >>> Encryption: no >>> Volume name(s): CLIENT-NAME2-Daily-0001 >>> Volume Session Id: 27 >>> Volume Session Time: 1189578554 >>> Last Volume Bytes: 8,855,455,711 (8.855 GB) >>> Non-fatal FD errors: 0 >>> SD Errors: 0 >>> FD termination status: Canceled >>> SD termination status: Canceled >>> Termination: Backup Canceled >>> >>> >>> >>> ------------------------------------------------------------------------- >>> This SF.net email is sponsored by: Microsoft >>> Defy all challenges. Microsoft(R) Visual Studio 2005. >>> http://clk.atdmt.com/MRT/go/vse0120000070mrt/direct/01/ >>> _______________________________________________ >>> Bacula-users mailing list >>> Bacula-users@lists.sourceforge.net >>> https://lists.sourceforge.net/lists/listinfo/bacula-users >> >> >> -- >> The information contained in this communication is confidential and may be >> legally privileged. It is intended solely for the use of the individual or >> entity to whom it is addressed. If you are not the intended recipient you >> are hereby notified that any disclosure, copying, distribution or any action >> taken or omitted in reliance on the contents of this information is strictly >> prohibited and may be unlawful. Whilst all reasonable steps are taken to >> ensure the accuracy and integrity of information and data transmitted >> electronically and to preserve the confidentiality thereof, the Berco Group >> and its associated business entities and/or units accept no liability or >> responsibility whatsoever if information or data is, for whatever reason, >> corrupted or does not reach its intended destination. >> >> ------------------------------------------------------------------------- >> This SF.net email is sponsored by: Microsoft >> Defy all challenges. Microsoft(R) Visual Studio 2005. >> http://clk.atdmt.com/MRT/go/vse0120000070mrt/direct/01/ >> _______________________________________________ >> Bacula-users mailing list >> Bacula-users@lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/bacula-users > > ------------------------------------------------------------------------- > This SF.net email is sponsored by: Microsoft > Defy all challenges. Microsoft(R) Visual Studio 2005. > http://clk.atdmt.com/MRT/go/vse0120000070mrt/direct/01/ > _______________________________________________ > Bacula-users mailing list > Bacula-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/bacula-users ------------------------------------------------------------------------- This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2005. http://clk.atdmt.com/MRT/go/vse0120000070mrt/direct/01/ _______________________________________________ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users