And yet more info (long, with lots of job reports). This error actually happened with *two* different tapes, and in both cases, when they errored, they had the same "last volume bytes" value.
This seems to be tied to network errors somehow. Observer: The job before this one loaded tape 0009. It writes to tape nine, and has a last volume bytes value of 15,747,641,980. The next job runs, but errors out due to a network glitch. 02-Oct 01:05 herodotus-dir: Start Backup JobId 6870, Job=fiscalpro.2005-10-02_01.05.20 02-Oct 01:05 herodotus-sd: Spooling data ... 02-Oct 03:02 herodotus-sd: Writing spooled data to Volume. Despooling 71,612,997,031 bytes ... 02-Oct 05:16 herodotus-dir: fiscalpro.2005-10-02_01.05.20 Fatal error: Network error with FD during Backup: ERR=Connection timed out 02-Oct 05:17 herodotus-dir: fiscalpro.2005-10-02_01.05.20 Fatal error: No Job status returned from FD. 02-Oct 05:27 herodotus-dir: fiscalpro.2005-10-02_01.05.20 Error: Bacula 1.36.1 (26Nov04): 02-Oct-2005 05:17:09 JobId: 6870 Job: fiscalpro.2005-10-02_01.05.20 Backup Level: Full Client: fiscalpro-fd FileSet: "wxpDefault" 2005-01-10 15:27:47 Pool: "Default" Storage: "herodotus-sd" Start time: 02-Oct-2005 01:05:21 End time: 02-Oct-2005 05:17:09 FD Files Written: 0 SD Files Written: 0 FD Bytes Written: 0 SD Bytes Written: 0 Rate: 0.0 KB/s Software Compression: None Volume name(s): Volume Session Id: 674 Volume Session Time: 1126137632 Last Volume Bytes: 51,747,561,167 Non-fatal FD errors: 0 SD Errors: 0 FD termination status: Error SD termination status: Running Termination: *** Backup Error *** Notice the last volume bytes value. Then, the next job (also errors, but with the EOM error message): 02-Oct 01:05 herodotus-dir: Start Backup JobId 6874, Job=webdev1.2005-10-02_01.05.24 02-Oct 01:05 herodotus-sd: Spooling data ... 02-Oct 03:02 herodotus-sd: Writing spooled data to Volume. Despooling 71,612,545,363 bytes ... 02-Oct 04:48 herodotus-sd: Writing spooled data to Volume. Despooling 71,613,118,263 bytes ... 02-Oct 06:15 herodotus-sd: webdev1.2005-10-02_01.05.24 Fatal error: Cannot write block. Device at EOM. 02-Oct 06:15 herodotus-sd: webdev1.2005-10-02_01.05.24 Fatal error: Fatal despooling error.02-Oct 06:15 herodotus-sd: webdev1.2005-10-02_01.05.24 Fatal error: Fatal device error: ERR= 02-Oct 06:15 webdev1-fd: webdev1.2005-10-02_01.05.24 Fatal error: backup.c:472 Network send error 32768 to SD. ERR=Broken pipe 02-Oct 06:15 herodotus-dir: webdev1.2005-10-02_01.05.24 Error: Bacula 1.36.1 (26Nov04): 02-Oct-2005 06:15:51 JobId: 6874 Job: webdev1.2005-10-02_01.05.24 Backup Level: Full Client: webdev1-fd FileSet: "webdev1-fileset" 2005-03-11 11:28:01 Pool: "Default" Storage: "herodotus-sd" Start time: 02-Oct-2005 01:05:25 End time: 02-Oct-2005 06:15:51 FD Files Written: 3,424 SD Files Written: 3,424 FD Bytes Written: 37,355,359,925 SD Bytes Written: 37,355,770,896 Rate: 2005.5 KB/s Software Compression: None Volume name(s): BaculaTape-0009-20041213 Volume Session Id: 678 Volume Session Time: 1126137632 Last Volume Bytes: 12,999,102,895 Non-fatal FD errors: 0 SD Errors: 0 FD termination status: Error SD termination status: Error Termination: *** Backup Error *** Fatal error, and Last Volume Bytes (LVB) less than before. Next job: 02-Oct 01:05 herodotus-dir: Start Backup JobId 6867, Job=textbook3.2005-10-02_01.05.17 02-Oct 01:05 herodotus-sd: Spooling data ... 02-Oct 03:02 herodotus-sd: Writing spooled data to Volume. Despooling 71,612,803,459 bytes ... 02-Oct 06:15 herodotus-sd: textbook3.2005-10-02_01.05.17 Fatal error: Cannot write block. Device at EOM. 02-Oct 06:15 herodotus-sd: textbook3.2005-10-02_01.05.17 Fatal error: Fatal despooling error.02-Oct 06:15 herodotus-sd: textbook3.2005-10-02_01.05.17 Fatal error: Fatal device error: ERR= 02-Oct 06:15 textbook3-fd: textbook3.2005-10-02_01.05.17 Fatal error: .. \filed\../../filed/backup.c:472 Network send error 32768 to SD. ERR=An existing connection was forcibly closed by the remote host. And same LVB. Another job with EOM error, and same LVB. Then a job that succeeds, and writes, but same LVB. Then I get a job that changes tape (0009 to 0010), but then fails with the EOM error, and ends up with the same LVB value. 02-Oct 01:05 herodotus-dir: Start Backup JobId 6868, Job=textbook4.2005-10-02_01.05.18 02-Oct 01:05 herodotus-sd: Spooling data ... 02-Oct 03:02 herodotus-sd: Writing spooled data to Volume. Despooling 71,612,674,411 bytes ... 02-Oct 05:13 herodotus-sd: End of Volume "BaculaTape-0009-20041213" at 47:819 on device /dev/nst0. Write of 64512 bytes got -1. 02-Oct 05:14 herodotus-sd: Re-read of last block succeeded. 02-Oct 05:14 herodotus-sd: End of medium on Volume "BaculaTape-0009-20041213" Bytes=47,049,606,471 Blocks=729,319 at 02-Oct-2005 05:14. 02-Oct 05:14 herodotus-dir: Purging oldest volume "BaculaTape-0010-20041213" 02-Oct 05:27 herodotus-dir: 215 Files on Volume "BaculaTape-0010-20041213" purged from catalog. 02-Oct 05:27 herodotus-dir: There are no more Jobs associated with Volume "BaculaTape-0010-20041213". Marking it purged. 02-Oct 05:27 herodotus-sd: 3301 Issuing autochanger "loaded drive 0" command. 02-Oct 05:27 herodotus-sd: 3302 Autochanger "loaded drive 0", result is Slot 9. 02-Oct 05:27 herodotus-sd: 3303 Issuing autochanger "unload slot 9, drive 0" command. 02-Oct 05:27 herodotus-sd: 3304 Issuing autochanger "load slot 10, drive 0" command. 02-Oct 05:30 herodotus-sd: 3305 Autochanger "load slot 10, drive 0", status is OK. 02-Oct 05:30 herodotus-sd: Recycled volume "BaculaTape-0010-20041213" on device "/dev/nst0", all previous data lost. 02-Oct 05:30 herodotus-sd: New volume "BaculaTape-0010-20041213" mounted on device /dev/nst0 at 02-Oct-2005 05:30. 02-Oct 05:36 textbook4-fd: Could not stat c:/Documents and Settings/Michael Lovecraft/My Documents/My Music/iTunes/iTunes Music/Ozzy Osbourne/Prince Of Darkness [Disc 2]: ERR=The system cannot find the file specified. 02-Oct 05:44 herodotus-sd: Committing spooled data to Volume. Despooling 70,261,901,515 bytes ... 02-Oct 06:16 herodotus-sd: textbook4.2005-10-02_01.05.18 Fatal error: Cannot write block. Device at EOM. 02-Oct 06:16 herodotus-sd: Sending spooled attrs to the Director. Despooling 1,590,123 bytes ... 02-Oct 06:17 textbook4-fd: textbook4.2005-10-02_01.05.18 Fatal error: Bad status 102 returned from Storage Daemon. 02-Oct 06:17 herodotus-dir: textbook4.2005-10-02_01.05.18 Error: Bacula 1.36.1 (26Nov04): 02-Oct-2005 06:17:24 JobId: 6868 Job: textbook4.2005-10-02_01.05.18 Backup Level: Full Client: textbook4-fd FileSet: "wxpDefault" 2005-01-10 15:27:47 Pool: "Default" Storage: "herodotus-sd" Start time: 02-Oct-2005 01:05:19 End time: 02-Oct-2005 06:17:24 FD Files Written: 4,023 SD Files Written: 4,023 FD Bytes Written: 8,942,960,505 SD Bytes Written: 8,943,908,548 Rate: 477.6 KB/s Software Compression: None Volume name(s): BaculaTape-0010-20041213|BaculaTape-0009-20041213 Volume Session Id: 672 Volume Session Time: 1126137632 Last Volume Bytes: 12,999,102,895 Non-fatal FD errors: 1 SD Errors: 0 FD termination status: Error SD termination status: Error Termination: *** Backup Error *** Then I have a few more jobs that succeed, but the LVB value stays the same, Then a few more jobs that fail, interspersed by success (LVB always the same, mind you). Then, today, I ejected 0010, put in 0011, ran a backup. The server got 0010 again, ran the backup, and succeeded. And LVB incremented too. Any ideas, tips, suggestions, etc? j----- k----- -- Joshua Kugler CDE System Administrator http://distance.uaf.edu/ ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users