from the email:
FAILURE DUMP SUMMARY:
coyote /GenesAmandaHelper-0.61/config-bak lev 0 partial taper: source
server crc (6158b8f5:29861110032) and input server crc
(4f171223:29861110032) differ)
coyote /GenesAmandaHelper-0.61/config-bak lev 0 FAILED [data timeout]
coyote /GenesAmandaHelper-0.61/config-bak lev 0 partial taper:
successfully taped a partial dump
coyote /opt lev 0 partial taper: source server crc
(1f484cf4:15602100421) and input server crc (a7bc8db8:15602100421)
differ)
coyote /opt lev 0 FAILED [data timeout]
coyote /opt lev 0 FAILED [failed to set shm-ring]
coyote /usr/movies lev 0 partial taper: source server crc
(55338628:13370009600) and input server crc (feecb660:13370009600)
differ)
coyote /usr/movies lev 0 was successfully retried
And:
FAILED DUMP DETAILS:
/-- coyote /GenesAmandaHelper-0.61/config-bak lev 0 FAILED [data
timeout]
sendbackup: info BACKUP=APPLICATION
sendbackup: info APPLICATION=amgtar
sendbackup: info
RECOVER_CMD=/bin/gzip -dc |/usr/local/libexec/amanda/application/amgtar
restore [./file-to-restore]+
sendbackup: info COMPRESS_SUFFIX=.gz
sendbackup: info end
\--------
/-- coyote /opt lev 0 FAILED [data timeout]
sendbackup: info BACKUP=APPLICATION
sendbackup: info APPLICATION=amgtar
sendbackup: info
RECOVER_CMD=/bin/gzip -dc |/usr/local/libexec/amanda/application/amgtar
restore [./file-to-restore]+
sendbackup: info COMPRESS_SUFFIX=.gz
sendbackup: info end
\--------
NOTES:
planner: Incremental of coyote:/usr/local bumped to level 2.
driver: coyote /GenesAmandaHelper-0.61/config-bak 20210122020104 0
[Will retry dump because of holding disk error: source server crc
(6158b8f5:29861110032) and input server crc (4f171223:29861110032)
differ)]
driver: coyote /opt 20210122020104 0 [Will retry dump because of
holding disk error: source server crc (1f484cf4:15602100421) and input
server crc (a7bc8db8:15602100421) differ)]
driver: coyote /usr/movies 20210122020104 0 [Will retry dump because of
holding disk error: source server crc (55338628:13370009600) and input
server crc (feecb660:13370009600) differ)]
taper: tape Dailys-2 kb 73355394 fm 79 [OK]
big estimate: coyote /GenesAmandaHelper-0.61/config-bak 0
est: 28000M out 0M
left in the holding disk:
root@coyote:data$ ls -l /sdb/dumps/20210122020104/
total 57456372
-rw------- 1 amanda amanda 1048576000 Jan 22 02:31
coyote._GenesAmandaHelper-0.61_config-bak.0
-rw------- 1 amanda amanda 1048576000 Jan 22 02:08
coyote._GenesAmandaHelper-0.61_config-bak.0.1
-rw------- 1 amanda amanda 1048576000 Jan 22 02:16
coyote._GenesAmandaHelper-0.61_config-bak.0.10
-rw------- 1 amanda amanda 1048576000 Jan 22 02:17
coyote._GenesAmandaHelper-0.61_config-bak.0.11
-rw------- 1 amanda amanda 1048576000 Jan 22 02:18
coyote._GenesAmandaHelper-0.61_config-bak.0.12
-rw------- 1 amanda amanda 1048576000 Jan 22 02:19
coyote._GenesAmandaHelper-0.61_config-bak.0.13
-rw------- 1 amanda amanda 1048576000 Jan 22 02:20
coyote._GenesAmandaHelper-0.61_config-bak.0.14
-rw------- 1 amanda amanda 1048576000 Jan 22 02:21
coyote._GenesAmandaHelper-0.61_config-bak.0.15
-rw------- 1 amanda amanda 1048576000 Jan 22 02:22
coyote._GenesAmandaHelper-0.61_config-bak.0.16
-rw------- 1 amanda amanda 1048576000 Jan 22 02:23
coyote._GenesAmandaHelper-0.61_config-bak.0.17
-rw------- 1 amanda amanda 1048576000 Jan 22 02:23
coyote._GenesAmandaHelper-0.61_config-bak.0.18
-rw------- 1 amanda amanda 1048576000 Jan 22 02:24
coyote._GenesAmandaHelper-0.61_config-bak.0.19
-rw------- 1 amanda amanda 1048576000 Jan 22 02:09
coyote._GenesAmandaHelper-0.61_config-bak.0.2
-rw------- 1 amanda amanda 1048576000 Jan 22 02:25
coyote._GenesAmandaHelper-0.61_config-bak.0.20
-rw------- 1 amanda amanda 1048576000 Jan 22 02:26
coyote._GenesAmandaHelper-0.61_config-bak.0.21
-rw------- 1 amanda amanda 1048576000 Jan 22 02:27
coyote._GenesAmandaHelper-0.61_config-bak.0.22
-rw------- 1 amanda amanda 1048576000 Jan 22 02:28
coyote._GenesAmandaHelper-0.61_config-bak.0.23
-rw------- 1 amanda amanda 1048576000 Jan 22 02:28
coyote._GenesAmandaHelper-0.61_config-bak.0.24
-rw------- 1 amanda amanda 1048576000 Jan 22 02:29
coyote._GenesAmandaHelper-0.61_config-bak.0.25
-rw------- 1 amanda amanda 1048576000 Jan 22 02:30
coyote._GenesAmandaHelper-0.61_config-bak.0.26
-rw------- 1 amanda amanda 1048576000 Jan 22 02:31
coyote._GenesAmandaHelper-0.61_config-bak.0.27
-rw------- 1 amanda amanda 501932304 Jan 22 02:31
coyote._GenesAmandaHelper-0.61_config-bak.0.28
-rw------- 1 amanda amanda 1048576000 Jan 22 02:10
coyote._GenesAmandaHelper-0.61_config-bak.0.3
-rw------- 1 amanda amanda 1048576000 Jan 22 02:11
coyote._GenesAmandaHelper-0.61_config-bak.0.4
-rw------- 1 amanda amanda 1048576000 Jan 22 02:12
coyote._GenesAmandaHelper-0.61_config-bak.0.5
-rw------- 1 amanda amanda 1048576000 Jan 22 02:13
coyote._GenesAmandaHelper-0.61_config-bak.0.6
-rw------- 1 amanda amanda 1048576000 Jan 22 02:14
coyote._GenesAmandaHelper-0.61_config-bak.0.7
-rw------- 1 amanda amanda 1048576000 Jan 22 02:15
coyote._GenesAmandaHelper-0.61_config-bak.0.8
-rw------- 1 amanda amanda 1048576000 Jan 22 02:16
coyote._GenesAmandaHelper-0.61_config-bak.0.9
-rw------- 1 amanda amanda 1048576000 Jan 22 02:54 coyote._opt.0
-rw------- 1 amanda amanda 1048576000 Jan 22 02:33 coyote._opt.0.1
-rw------- 1 amanda amanda 1048576000 Jan 22 02:49 coyote._opt.0.10
-rw------- 1 amanda amanda 1048576000 Jan 22 02:49 coyote._opt.0.11
-rw------- 1 amanda amanda 1048576000 Jan 22 02:50 coyote._opt.0.12
-rw------- 1 amanda amanda 1048576000 Jan 22 02:51 coyote._opt.0.13
-rw------- 1 amanda amanda 922527941 Jan 22 02:54 coyote._opt.0.14
-rw------- 1 amanda amanda 1048576000 Jan 22 02:34 coyote._opt.0.2
-rw------- 1 amanda amanda 1048576000 Jan 22 02:35 coyote._opt.0.3
-rw------- 1 amanda amanda 1048576000 Jan 22 02:38 coyote._opt.0.4
-rw------- 1 amanda amanda 1048576000 Jan 22 02:41 coyote._opt.0.5
-rw------- 1 amanda amanda 1048576000 Jan 22 02:43 coyote._opt.0.6
-rw------- 1 amanda amanda 1048576000 Jan 22 02:45 coyote._opt.0.7
-rw------- 1 amanda amanda 1048576000 Jan 22 02:46 coyote._opt.0.8
-rw------- 1 amanda amanda 1048576000 Jan 22 02:47 coyote._opt.0.9
-rw------- 1 amanda amanda 1048576000 Jan 22 03:59 coyote._usr_movies.0
-rw------- 1 amanda amanda 1048576000 Jan 22 03:55 coyote._usr_movies.0.1
-rw------- 1 amanda amanda 1048576000 Jan 22 03:58
coyote._usr_movies.0.10
-rw------- 1 amanda amanda 1048576000 Jan 22 03:58
coyote._usr_movies.0.11
-rw------- 1 amanda amanda 787523584 Jan 22 03:59
coyote._usr_movies.0.12
-rw------- 1 amanda amanda 1048576000 Jan 22 03:55 coyote._usr_movies.0.2
-rw------- 1 amanda amanda 1048576000 Jan 22 03:56 coyote._usr_movies.0.3
-rw------- 1 amanda amanda 1048576000 Jan 22 03:56 coyote._usr_movies.0.4
-rw------- 1 amanda amanda 1048576000 Jan 22 03:56 coyote._usr_movies.0.5
-rw------- 1 amanda amanda 1048576000 Jan 22 03:56 coyote._usr_movies.0.6
-rw------- 1 amanda amanda 1048576000 Jan 22 03:56 coyote._usr_movies.0.7
-rw------- 1 amanda amanda 1048576000 Jan 22 03:56 coyote._usr_movies.0.8
-rw------- 1 amanda amanda 1048576000 Jan 22 03:57 coyote._usr_movies.0.9
root@coyote:data$
And I was asked to show the amstatus output when it failed:
root@coyote:data$ cat /home/amanda/log/amstat.d/amstat-210122-0507
Using: /usr/local/var/amanda/Daily/amdump.1
Thats it, normally its about 10k of stuff
One of the failed ones header:
root@coyote:data$ dd if=00046.coyote._opt.0 bs=32k count=1
AMANDA: SPLIT_FILE 20210122020104 coyote /opt part 1/-1 lev 0 comp .gz
program APPLICATION
APPLICATION=amgtar
ORIGSIZE=24681320
NATIVE-CRC=b5751dc0:25273671680
CLIENT-CRC=a7bc8db8:15602100421
SERVER-CRC=a7bc8db8:15602100421
DLE=<<ENDDLE
<dle>
<program>APPLICATION</program>
<disk>/opt</disk>
<level>0</level>
<auth>bsdtcp</auth>
<compress>BEST</compress>
<record>YES</record>
<index>YES</index>
<datapath>AMANDA</datapath>
<exclude>
<list>/GenesAmandaHelper-0.61/excludes</list>
</exclude>
<backup-program>
<plugin>amgtar</plugin>
<property>
<name>ignore</name>
<value encoding="raw"
raw="OiBzb2NrZXQgaWdub3JlZCQ=">:_socket_ignored$</value> <value
encoding="raw"
raw="ZmlsZSBjaGFuZ2VkIGFzIHdlIHJlYWQgaXQk">file_changed_as_we_read_it$</value>
</property>
<property>
<name>one-file-system</name>
<value>yes</value>
</property>
<property>
<name>check-device</name>
<value>no</value>
</property>
</backup-program>
</dle>
ENDDLE
To restore, position tape at start of file and run:
dd if=<tape> bs=32k
skip=1 | /bin/gzip -dc | /usr/local/libexec/amanda/application/amgtar
restore [./file-to-restore]+
1+0 records in
1+0 records out
32768 bytes (33 kB, 32 KiB) copied, 0.0179666 s, 1.8 MB/s
If I hunt down a succesfully retried dle and dump its header, there will
not be any crc reports in it.
Conclusions/clues:
1. it only blows up on a big, but randomly selected level 0
that may be from any of the 5 machines being backed up.
2. its nearly always because of a CRC error in the holding disk.
3. the holding disk has been swapped out twice now. Original was spinning
rust, 2 replacements are SSD's and are much faster.
4. amstatus always fails, IMO the failure that starts all this.
5. the error messages are not the least illuminating IMNSHO. the dle
for /GenesAmandelper/config-bak is 40Gigs but its the last 60 copys of
the configs that made these backup, and a copy of amanda's own database
for the last 60 backups, and the last 60 reports generated by amanda's
activities.
Obviously I need help. Many thanks to those who try.
Cheers, Gene Heskett
--
"There are four boxes to be used in defense of liberty:
soap, ballot, jury, and ammo. Please use in that order."
-Ed Howdershelt (Author)
If we desire respect for the law, we must first make the law respectable.
- Louis D. Brandeis
Genes Web page <http://geneslinuxbox.net:6309/gene>