On Monday 03 July 2006 23:12, Wolfgang Denk wrote: > I ran into a strange failure mode, which I don't understand: > > This is one of the typical reports I get when backing up the catalog: > > JobId: 2974 > Job: BackupCatalog.2006-06-30_01.00.00 > Backup Level: Full > Client: "xxxxxx-fd" i686-pc-linux-gnu,redhat,(Tettnang) > FileSet: "Catalog" 2005-12-21 10:55:54 > Pool: "Default" > Storage: "File" > Scheduled time: 30-Jun-2006 01:00:00 > Start time: 30-Jun-2006 03:52:50 > End time: 30-Jun-2006 04:13:49 > Priority: 99 > FD Files Written: 1 > SD Files Written: 1 > FD Bytes Written: 1,084,190,632 > SD Bytes Written: 1,084,190,739 > Rate: 861.2 KB/s > Software Compression: 73.7 % > Volume name(s): xxxxxx.denx.de-024 > Volume Session Id: 235 > Volume Session Time: 1144017997 > Last Volume Bytes: 3,840,609,011 > Non-fatal FD errors: 0 > SD Errors: 0 > FD termination status: OK > SD termination status: OK > Termination: Backup OK > > Looks perfectly OK to me. > > A "query" gives: > > Choose a query (1-17): 15 > Enter Volume name: xxxxxx.denx.de-024 > +-------+---------------+---------------------+------+-------+-------+----- >----------+--------+ > > | JobId | Name | StartTime | Type | Level | Files | > | Bytes | Status | > > +-------+---------------+---------------------+------+-------+-------+----- >----------+--------+ > > | 2,926 | BackupCatalog | 2006-06-27 01:13:43 | B | F | 1 | > | 1,081,685,712 | T | 2,942 | BackupCatalog | 2006-06-28 01:00:30 | B > | | F | 1 | 1,035,180,351 | T | 2,958 | BackupCatalog | > | 2006-06-29 04:01:32 | B | F | 1 | 1,080,280,688 | T | > | 2,974 | BackupCatalog | 2006-06-30 03:52:50 | B | F | 1 | > | 1,084,190,632 | T | 2,990 | BackupCatalog | 2006-07-01 01:15:30 | B > | | F | 1 | 1,082,035,271 | T | 3,006 | BackupCatalog | > | 2006-07-03 15:33:48 | B | F | 1 | 1,578,373,106 | T | > | 3,022 | BackupCatalog | 2006-07-03 15:54:33 | B | F | 1 | > | 1,494,147,110 | A | > > +-------+---------------+---------------------+------+-------+-------+----- >----------+--------+ > > Looks OK to me, too. > > However: > > # ls -l > ... > -rw-r----- 1 root root 7999988140 Apr 11 02:33 xxxxxx.denx.de-012 > -rw-r----- 1 root root 7999967042 Apr 16 03:45 xxxxxx.denx.de-013 > -rw-r----- 1 root root 4426780672 Apr 21 02:21 xxxxxx.denx.de-014 > -rw-r----- 1 root root 20746240 Apr 27 01:08 xxxxxx.denx.de-015 > -rw-r----- 1 root root 4096 May 4 01:48 xxxxxx.denx.de-016 > -rw-r----- 1 root root 4096 May 16 01:10 xxxxxx.denx.de-017 > -rw-r----- 1 root root 4096 May 24 01:26 xxxxxx.denx.de-018 > -rw-r----- 1 root root 4096 May 31 03:25 xxxxxx.denx.de-019 > -rw-r----- 1 root root 4096 Jun 7 02:03 xxxxxx.denx.de-020 > -rw-r----- 1 root root 16384 Jun 12 01:23 xxxxxx.denx.de-021 > -rw-r----- 1 root root 4096 Jun 19 01:16 xxxxxx.denx.de-022 > -rw-r----- 1 root root 4096 Jun 27 01:23 xxxxxx.denx.de-023 > -rw-r----- 1 root root 4096 Jul 3 16:15 xxxxxx.denx.de-024 > ... > > There is a size limit of 8 GB per volume which worked fine for a long > time, but the something happened - the problem was that I ran out of > disk space on the partition where the backup volumes are stored. > Actually a long time ago, but without noticing. > > What I don't understand is: neither bacula-dir nor bacula-sd gave > *any* indication of the problem. > > > dir Version: 1.38.5 (18 January 2006) i686-pc-linux-gnu redhat (Tettnang) > sd Version: 1.38.5 (18 January 2006) i686-pc-linux-gnu redhat (Tettnang) > > The Archive Device is on a NFS mounted partition. I don't see any > error messages in any of the system logs, either. > > > Any ideas how to get aware of such a problem in the future?
Unless your OS is broken, Bacula always detects and reports when a volume gets a write error, which is the case when your partition fills up. I would classify the fact that Bacula continues attempting to create Volumes on a harddisk after a full partition as missing code (a bug if you will). However, before making any definitive statement, I would like to see the "error" messages that Bacula produced for you. They should be in the log. Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642 _______________________________________________ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users