Hello there, I have a problem while backing up two windows servers in two different installations. The scenarios are almost equal:
- Bacula-dir (v5.0.1) on Ubuntu 10.04.3 virtualized with VMware vSphere 5 Hypervisor - Bacula-sd (v5.0.1) on same server with file storage on a NAS, mounted via iSCSI. - Windows Server with problem in installation 1: Windows 2003 32bit SP1 virtulized on different VMware vSphere 5 Hypervisor, Bacula-fd 5.2.3 - Windows Server with problem in installation 2: Windows 2008R2 SP1 64bit virtulized on same hypervisor, Bacula-fd 5.2.6 - Many other Servers (virtualized and physical), Windos or Linux, without any problems. In both installations occur the errors shown below two to three times a week (not in all backups). I have three jobs per server and it is not always the same job, but always the same server. What I have done: - Use a newer FD version - Just one concurrent job per server ("Maximum Concurrent Jobs = 1" on all FDs) - Set "Heartbeat Interval = 60" on FD, SD and Dir I hope, someone has a clue for me or a hint, where to troubleshoot further. Best regards Michael Neuendorf The logs of the two server: 2012-09-19 22:58:45 bacula-dir JobId 13962: Start Backup JobId 13962, Job=nina_systemstate.2012-09-19_21.50.01_31 2012-09-19 22:58:46 bacula-dir JobId 13962: Using Device "FileStorageLocal" 2012-09-19 23:02:41 nina-fd JobId 13962: DIR and FD clocks differ by 233 seconds, FD automatically compensating. 2012-09-19 23:02:41 nina-fd JobId 13962: DIR and FD clocks differ by 233 seconds, FD automatically compensating. 2012-09-19 23:02:45 nina-fd JobId 13962: shell command: run ClientRunBeforeJob "C:/backup/bacula/systemstate.cmd" 2012-09-19 23:02:45 nina-fd JobId 13962: shell command: run ClientRunBeforeJob "C:/backup/bacula/systemstate.cmd" 2012-09-19 23:03:40 bacula-dir JobId 13962: Sending Accurate information. 2012-09-19 23:05:12 bacula-dir-sd JobId 13962: Job write elapsed time = 00:01:21, Transfer rate = 2.517 M Bytes/second 2012-09-19 23:09:06 nina-fd JobId 13962: shell command: run ClientAfterJob "C:/backup/bacula/systemstate.cmd cleanup" 2012-09-19 23:09:06 nina-fd JobId 13962: shell command: run ClientAfterJob "C:/backup/bacula/systemstate.cmd cleanup" 2012-09-19 23:05:17 bacula-dir JobId 13962: Fatal error: Network error with FD during Backup: ERR=Connection reset by peer 2012-09-19 23:05:18 bacula-dir-sd JobId 13962: JobId=13962 Job="nina_systemstate.2012-09-19_21.50.01_31" marked to be canceled. 2012-09-19 23:05:19 bacula-dir JobId 13962: Error: Bacula bacula-dir 5.0.1 (24Feb10): 19-Sep-2012 23:05:19 Build OS: i486-pc-linux-gnu ubuntu 10.04 JobId: 13962 Job: nina_systemstate.2012-09-19_21.50.01_31 Backup Level: Incremental, since=2012-09-18 23:07:02 Client: "nina-fd" 5.2.3 (16Dec11) Microsoft Windows Home ServerStandard Edition Service Pack 1 (build 3790),Cross-compile,Win32 FileSet: "nina_systemstate-set" 2012-04-30 21:50:01 Pool: "DailyLocal" (From Job IncPool override) Catalog: "MyCatalog" (From Client resource) Storage: "Local" (From Pool resource) Scheduled time: 19-Sep-2012 21:50:01 Start time: 19-Sep-2012 23:03:37 End time: 19-Sep-2012 23:05:19 Elapsed time: 1 min 42 secs Priority: 10 FD Files Written: 4 SD Files Written: 0 FD Bytes Written: 203,957,393 (203.9 MB) SD Bytes Written: 0 (0 B) Rate: 1999.6 KB/s Software Compression: 59.0 % VSS: no Encryption: no Accurate: yes Volume name(s): DailyLocal-0117 Volume Session Id: 674 Volume Session Time: 1346772020 Last Volume Bytes: 32,675,913,343 (32.67 GB) Non-fatal FD errors: 0 SD Errors: 0 FD termination status: Error SD termination status: Running Termination: *** Backup Error *** and 2012-09-24 23:50:02 bacula-dir JobId 4017: Start Sicherung JobId 4017, Job=BuHaSrv1_datev.2012-09-24_23.50.00_33 2012-09-24 23:50:02 bacula-dir JobId 4017: There are no more Jobs associated with Volume "DailyLocal-0005". Marking it purged. 2012-09-24 23:50:02 bacula-dir JobId 4017: All records pruned from Volume "DailyLocal-0005"; marking it "Purged" 2012-09-24 23:50:02 bacula-dir JobId 4017: Recycled volume "DailyLocal-0005" 2012-09-24 23:50:02 bacula-dir JobId 4017: Using Device "FileStorageLocal" 2012-09-24 23:50:03 buhasrv1-fd JobId 4017: shell command: run ClientRunBeforeJob "c:/backup/stopsql.cmd" 2012-09-24 23:50:03 buhasrv1-fd JobId 4017: ClientRunBeforeJob: 2012-09-24 23:50:03 buhasrv1-fd JobId 4017: ClientRunBeforeJob: C:\Windows\system32>net stop MSSQL$DATEV_SV_SE01 2012-09-24 23:50:08 buhasrv1-fd JobId 4017: ClientRunBeforeJob: SQL Server (DATEV_SV_SE01) wird beendet.. 2012-09-24 23:50:08 buhasrv1-fd JobId 4017: ClientRunBeforeJob: SQL Server (DATEV_SV_SE01) wurde erfolgreich beendet. 2012-09-24 23:50:08 buhasrv1-fd JobId 4017: ClientRunBeforeJob: 2012-09-24 23:50:08 bacula-dir JobId 4017: Sending Accurate information. 2012-09-24 23:50:10 bacula-sd JobId 4017: Recycled volume "DailyLocal-0005" on device "FileStorageLocal" (/var/spool/bacula/local), all previous data lost. 2012-09-24 23:50:11 buhasrv1-fd JobId 4017: Generate VSS snapshots. Driver="Win64 VSS", Drive(s)="F" 2012-09-24 23:55:21 buhasrv1-fd JobId 4017: VSS Writer (BackupComplete): "Task Scheduler Writer", State: 0x1 (VSS_WS_STABLE) 2012-09-24 23:55:21 buhasrv1-fd JobId 4017: VSS Writer (BackupComplete): "VSS Metadata Store Writer", State: 0x1 (VSS_WS_STABLE) 2012-09-24 23:55:21 buhasrv1-fd JobId 4017: VSS Writer (BackupComplete): "Performance Counters Writer", State: 0x1 (VSS_WS_STABLE) 2012-09-24 23:55:21 buhasrv1-fd JobId 4017: VSS Writer (BackupComplete): "System Writer", State: 0x1 (VSS_WS_STABLE) 2012-09-24 23:55:21 buhasrv1-fd JobId 4017: VSS Writer (BackupComplete): "ASR Writer", State: 0x1 (VSS_WS_STABLE) 2012-09-24 23:55:21 buhasrv1-fd JobId 4017: VSS Writer (BackupComplete): "COM+ REGDB Writer", State: 0x1 (VSS_WS_STABLE) 2012-09-24 23:55:21 buhasrv1-fd JobId 4017: VSS Writer (BackupComplete): "SqlServerWriter", State: 0x1 (VSS_WS_STABLE) 2012-09-24 23:55:21 buhasrv1-fd JobId 4017: VSS Writer (BackupComplete): "Shadow Copy Optimization Writer", State: 0x1 (VSS_WS_STABLE) 2012-09-24 23:55:21 buhasrv1-fd JobId 4017: VSS Writer (BackupComplete): "WMI Writer", State: 0x1 (VSS_WS_STABLE) 2012-09-24 23:55:21 buhasrv1-fd JobId 4017: VSS Writer (BackupComplete): "TermServLicensing", State: 0x1 (VSS_WS_STABLE) 2012-09-24 23:55:21 buhasrv1-fd JobId 4017: VSS Writer (BackupComplete): "Registry Writer", State: 0x1 (VSS_WS_STABLE) 2012-09-24 23:55:22 buhasrv1-fd JobId 4017: shell command: run ClientAfterJob "c:/backup/startsql.cmd" 2012-09-24 23:55:22 bacula-sd JobId 4017: Job write elapsed time = 00:05:12, Transfer rate = 41.31 M Bytes/second 2012-09-24 23:55:22 buhasrv1-fd JobId 4017: ClientAfterJob: 2012-09-24 23:55:22 buhasrv1-fd JobId 4017: ClientAfterJob: C:\Windows\system32>net start MSSQL$DATEV_SV_SE01 2012-09-24 23:55:26 buhasrv1-fd JobId 4017: ClientAfterJob: SQL Server (DATEV_SV_SE01) wird gestartet.. 2012-09-24 23:55:26 buhasrv1-fd JobId 4017: ClientAfterJob: SQL Server (DATEV_SV_SE01) wurde erfolgreich gestartet. 2012-09-24 23:55:26 buhasrv1-fd JobId 4017: ClientAfterJob: 2012-09-24 23:55:26 bacula-dir JobId 4017: Fatal error: Netzwerkfehler mit FD bei Backup: ERR=Die Verbindung wurde vom Kommunikationspartner zurückgesetzt 2012-09-24 23:55:26 bacula-dir JobId 4017: Error: Bacula bacula-dir 5.0.1 (24Feb10): 24-Sep-2012 23:55:26 Build OS: i486-pc-linux-gnu ubuntu 10.04 JobId: 4017 Job: BuHaSrv1_datev.2012-09-24_23.50.00_33 Backup Level: Incremental, seit=2012-09-22 19:10:06 Client: "BuHaSrv1-fd" 5.2.6 (21Feb12) Microsoft Windows Server 2008 R2 Standard Edition Service Pack 1 (build 7601), 64-bit,Cross-compile,Win64 FileSet: "BuHaSrv1_datev-set" 2011-11-30 21:50:00 Pool: "DailyLocal" (From Job IncPool override) Catalog: "MyCatalog" (From Client resource) Storage: "Local" (From Pool Ressource) Scheduled time: 24-Sep-2012 23:50:00 Start time: 24-Sep-2012 23:50:08 End time: 24-Sep-2012 23:55:26 Elapsed time: 5 mins 18 secs Priority: 5 FD Files Written: 141 SD Files Written: 141 FD Bytes Written: 12,888,814,964 (12.88 GB) SD Bytes Written: 12,888,841,033 (12.88 GB) Rate: 40530.9 KB/s Software Compression: None VSS: yes Encryption: no Accurate: yes Volume name(s): DailyLocal-0005 Volume Session Id: 274 Volume Session Time: 1346153192 Last Volume Bytes: 12,898,403,662 (12.89 GB) Non-fatal FD errors: 0 SD Errors: 0 FD termination status: Error SD termination status: OK Termination: *** Sicherungsfehler *** NovaNet GmbH Kupferstr. 65 44532 Lünen Telefon: 02306/202100 FAX: 02306/202109 WEB: www.novanetgmbh.de Firmensitz: Lünen Amtsgericht Dortmund HRB 17273 USt-ID DE 124793480, St.-Nr. 316/5759/0318 Geschäftsführerin: Dipl. Informatikerin (FH) Desiree Wünsche ------------------------------------------------------------------------------ Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ _______________________________________________ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users