I'm getting a little frustrated trying to bring my new backup server online.
I'm running Debian Stretch on a Dell R730, hooked to a Dell TL200 library over
SAS. The library has a single LTO6 drive. Bacula 7.4.4 is installed, and as
far as I can tell is correctly set up and configured. It passes all the btape
diagnostics with no trouble. However, nothing I try from inside bacula itself
seems able to access the tape drive itself. It *is* successful in manipulating
the robotics however:
*mount storage=Library2 slot=1 drive=0
3304 Issuing autochanger "load slot 1, drive 0" command for vol .
3305 Autochanger "load slot 1, drive 0", status is OK for vol .
3001 Mounted Volume: 000015L6
3001 Device ""Drive-1" (/dev/nst0)" is already mounted with Volume "000015L6"
This is my second bacula installation. My previous backup server (currently
running on Debian Jessie) has been successfully backing up half a dozen clients
for ten years on a different TL2000 using LTO3 tapes.
It started with an inability to label the tapes. Then I noticed that btape
allows you to do a label, so I went through and labeled each cartridge by hand
in btape, recognizing that I'd have to add them to the catalog by hand, hoping
that the problem was just with the label command. OK, wishful thinking...
All my volumes are now labeled and in the catalog:
*list volumes
Using Catalog "MyCatalog"
Pool: Default
+---------+------------+-----------+---------+----------+----------+--------------+---------+------+-----------+-----------+-------------+-----------+
| MediaId | VolumeName | VolStatus | Enabled | VolBytes | VolFiles |
VolRetention | Recycle | Slot | InChanger | MediaType | LastWritten | ExpiresIn
|
+---------+------------+-----------+---------+----------+----------+--------------+---------+------+-----------+-----------+-------------+-----------+
| 1 | CLNU00L1 | Cleaning | 1 | 1 | 0 |
31,536,000 | 1 | 4 | 1 | | NULL | NULL |
| 2 | CLN005L3 | Cleaning | 1 | 1 | 0 |
31,536,000 | 1 | 8 | 1 | | NULL | NULL |
| 3 | 000015L6 | Append | 1 | 0 | 0 |
31,536,000 | 1 | 1 | 1 | LTO6 | NULL | NULL |
| 5 | 000021L6 | Append | 1 | 0 | 0 |
31,536,000 | 1 | 3 | 1 | LTO6 | NULL | NULL |
| 6 | 000018L6 | Append | 1 | 0 | 0 |
31,536,000 | 1 | 2 | 1 | LTO6 | NULL | NULL |
| 7 | 000014L6 | Append | 1 | 0 | 0 |
31,536,000 | 1 | 5 | 1 | LTO6 | NULL | NULL |
| 8 | 000017L6 | Append | 1 | 0 | 0 |
31,536,000 | 1 | 6 | 1 | LTO6 | NULL | NULL |
| 9 | 000020L6 | Append | 1 | 0 | 0 |
31,536,000 | 1 | 7 | 1 | LTO6 | NULL | NULL |
| 10 | 000013L6 | Append | 1 | 0 | 0 |
31,536,000 | 1 | 9 | 1 | LTO6 | NULL | NULL |
| 11 | 000016L6 | Append | 1 | 0 | 0 |
31,536,000 | 1 | 10 | 1 | LTO6 | NULL | NULL |
| 12 | 000019L6 | Append | 1 | 0 | 0 |
31,536,000 | 1 | 11 | 1 | LTO6 | NULL | NULL |
| 13 | 000012L6 | Append | 1 | 0 | 0 |
31,536,000 | 1 | 12 | 1 | LTO6 | NULL | NULL |
| 14 | 000009L6 | Append | 1 | 0 | 0 |
31,536,000 | 1 | 13 | 1 | LTO6 | NULL | NULL |
| 15 | 000006L6 | Append | 1 | 0 | 0 |
31,536,000 | 1 | 14 | 1 | LTO6 | NULL | NULL |
| 16 | 000003L6 | Append | 1 | 0 | 0 |
31,536,000 | 1 | 15 | 1 | LTO6 | NULL | NULL |
| 17 | 000011L6 | Append | 1 | 0 | 0 |
31,536,000 | 1 | 16 | 1 | LTO6 | NULL | NULL |
| 18 | 000008L6 | Append | 1 | 0 | 0 |
31,536,000 | 1 | 17 | 1 | LTO6 | NULL | NULL |
| 19 | 000005L6 | Append | 1 | 0 | 0 |
31,536,000 | 1 | 18 | 1 | LTO6 | NULL | NULL |
| 20 | 000002L6 | Append | 1 | 0 | 0 |
31,536,000 | 1 | 19 | 1 | LTO6 | NULL | NULL |
| 23 | 000004L6 | Append | 1 | 0 | 0 |
31,536,000 | 1 | 22 | 1 | LTO6 | NULL | NULL |
| 24 | 000001L6 | Append | 1 | 0 | 0 |
31,536,000 | 1 | 23 | 1 | LTO6 | NULL | NULL |
| 25 | 000010L6 | Append | 1 | 0 | 0 |
31,536,000 | 1 | 20 | 1 | LTO6 | NULL | NULL |
| 26 | 000007L6 | Append | 1 | 0 | 0 |
31,536,000 | 1 | 21 | 1 | LTO6 | NULL | NULL |
+---------+------------+-----------+---------+----------+----------+--------------+---------+------+-----------+-----------+-------------+-----------+
Pool: File
No results to list.
Pool: Scratch
No results to list.
Still, when I try to do a backup, I still get *immediate* errors. No pause for
the system to go the the hardware, which is a hint that it's probably a
configuration error.
*run
A job name must be specified.
The defined Job resources are:
1: Odin
2: BackupCatalog
3: RestoreFiles
Select Job resource (1-3): 1
Run Backup job
JobName: Odin
Level: Incremental
Client: odin-fd
FileSet: Full Set Odin
Pool: Default (From Job resource)
Storage: Library2 (From Pool resource)
When: 2017-12-18 10:23:58
Priority: 10
OK to run? (yes/mod/no): yes
Job queued. JobId=46
You have messages.
*messages
18-Dec 10:24 odin-dir JobId 46: No prior Full backup Job record found.
18-Dec 10:24 odin-dir JobId 46: No prior or suitable Full backup found in
catalog. Doing FULL backup.
18-Dec 10:24 odin-dir JobId 46: Start Backup JobId 46,
Job=Odin.2017-12-18_10.24.00_08
18-Dec 10:24 odin-sd JobId 46: Fatal error: Device reservation failed for
JobId=46: 1000 OK VolName=000015L6 VolJobs=0 VolFiles=0 VolBlocks=0 VolBytes=0
VolABytes=0 VolHoleBytes=0 VolHoles=0 VolMounts=0 VolErrors=0 VolWrites=0
MaxVolBytes=0 VolCapacityBytes=0 VolStatus=Append Slot=1 MaxVolJobs=0
MaxVolFiles=0 InChanger=1 VolReadTime=0 VolWriteTime=0 EndFile=0 EndBlock=0
VolType=0 LabelType=0 MediaId=3 ScratchPoolId=0
18-Dec 10:24 odin-dir JobId 46: Fatal error:
Storage daemon didn't accept Device "AutochangerOdin" because:
3924 Device "AutochangerOdin" not in SD Device resources or no matching
Media Type.
18-Dec 10:24 odin-dir JobId 46: Error: Bacula odin-dir 7.4.4 (202Sep16):
Build OS: x86_64-pc-linux-gnu debian 9.0
JobId: 46
Job: Odin.2017-12-18_10.24.00_08
Backup Level: Full (upgraded from Incremental)
Client: "odin-fd"
FileSet: "Full Set Odin" 2017-08-09 23:05:00
Pool: "Default" (From Job resource)
Catalog: "MyCatalog" (From Client resource)
Storage: "Library2" (From Pool resource)
Scheduled time: 18-Dec-2017 10:23:58
Start time: 18-Dec-2017 10:24:02
End time: 18-Dec-2017 10:24:02
Elapsed time: 0 secs
Priority: 10
FD Files Written: 0
SD Files Written: 0
FD Bytes Written: 0 (0 B)
SD Bytes Written: 0 (0 B)
Rate: 0.0 KB/s
Software Compression: None
Snapshot/VSS: no
Encryption: no
Accurate: no
Volume name(s):
Volume Session Id: 2
Volume Session Time: 1513619722
Last Volume Bytes: 0 (0 B)
Non-fatal FD errors: 1
SD Errors: 0
FD termination status:
SD termination status:
Termination: *** Backup Error ***
What the error message seems to be telling me is that there is no device
"AutochangerOdin" defined in the storage daemon. The following is the stanza
for the autochanger in bacula-dir.conf:
#From bacula-dir.conf
# Definition of tape library device
Storage {
Name = Library2
# Do not use "localhost" here
Address = odin # N.B. Use a fully qualified name here
SDPort = 9103
Password = "mypassword"
Device = AutochangerOdin
Media Type = LTO6
Autochanger = yes # enable for autochanger device
}
Here is the section in bacula-sd.conf:
#From bacula-sd.conf
#
# An autochanger device with
# ONE drive!
Autochanger {
Name = AutochangerOdin
Device = Drive-1
Changer Command = "/etc/bacula/scripts/mtx-changer %c %o %S %a %d"
Changer Device = /dev/autochanger1
# Changer Device = /dev/tape/by-id/scsi-1IBM_3573-TL_00X2U78BZ022_LL0
}
Device {
Name = Drive-1 #
Description = "LT06 inside Dell TL2000 Library"
Drive Index = 0
Media Type = LT06
Archive Device = /dev/nst0
# Archive Device = /dev/tape/by-id/scsi-35000e11164c42001-nst
# Archive Device = /dev/tape/by-path/pci-0000:05:00.0-sas-phy2-lun-0-nst
AutomaticMount = yes; # when device opened, read it
AlwaysOpen = yes;
RemovableMedia = yes;
RandomAccess = no;
AutoChanger = yes
SpoolDirectory = "/var/spool/bacula"
MaximumSpoolSize = 485G
Maximum Network Buffer Size = 65536
Offline On Unmount = no
# #
# # Enable the Alert command only if you have the mtx package loaded
# # Note, apparently on some systems, tapeinfo resets the SCSI controller
# # thus if you turn this on, make sure it does not reset your SCSI
# # controller. I have never had any problems, and smartctl does
# # not seem to cause such problems.
# #
#Alert Command = "sh -c 'tapeinfo -f %c |grep TapeAlert|cat'"
# If you have smartctl, enable this, it has more info than tapeinfo
Alert Command = "sh -c 'smartctl -H -l error %c'"
}
My previous run at this (last fall) I got the advice to use by-id or by-path
devices, but that is irrelevent, because those are just symlinks to the real
devices, so I went back to the real devices. It might make a difference if I
was rebooting the system regularly and if I didn't have udev rules in place to
keep the devices constant, but it's not a factor here.
What I'm hoping for is that there *is* a misconfiguration that I'm just not
seeing. When I look at it, it seems fine to me, but I'm hoping someone else
can see what's not obvious to me. Why would the director think the device
"AutochangerOdin" does not exist? Or is that a red herring? Is it a problem
with the definition of the Media Type? The catalog shows a MediaType of LTO6,
but the error message from the failed backup shows a lot of parameters, but no
MediaType. Could this be related? How would I fix that?
Thanks for any advice.
Steve Garcia
Ignorance killed the cat, curiosity was framed.
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
Bacula-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/bacula-users