On 10/28/2017 10:10 AM, Phil Stracchino wrote:
> On 10/28/17 04:15, Kern Sibbald wrote:
>> Hello,
>>
>> Thanks for the feedback.  Can you confirm that your Bacula signs on with 
>> version 9.0.5?  If so, it means that some recent patches that I have 
>> made for this problem (3-4 bug reports) solve the problem :-)
> 
> I will definitely download and test.

Hi Kern,

Version 9.0.5 from git seems to mitigate this issue. (with a caveat or two).

In my quick test here, I set up a restore job knowing full well that the
storage array with the volumes required to do the restore was powered off -
hoping to force Bacula to ask the operator for a volume and then wait. :)

Bacula attempted to reserve, and then access each of the 6 disk devices in the
autochanger. Of course it could not open any of them because the array was
off/dismounted. It properly warned me for each device, then tried to loop
through the 6 drives 3 more times (for a total of 4 loops), and then it marked
the job as:

"is waiting on Storage "aoe-file"

So far so good.

However, a few things:

1. It performed this loop every 30 seconds generating a lot of logging
2. It never mailed the operator to ask for a volume.
3. After pretty much exactly 10 minutes, the job was failed and a normal "job
failure" email was sent to the admin.

This second one might be the correct behavior since it is not trying to find a
volume, it simply cannot access the defined drive devices -- something to
think about... Should an operator be notified when a device cannot be opened
by the SD?

Attached is the first and last loop and job summary since it would just wrap
horribly in this email. :)


Kern, I understand that this test I just did may be a corner case and may just
be throwing a monkey wrench into the mix, but similar scenarios have been seen
in BEE Support, so it might be sensible to trap for this.

Best regards,

Bill



-- 
Bill Arlofski
http://www.revpol.com/bacula
-- Not responsible for anything below this line --
*llist joblog jobid=17968
    time: 2017-10-28 10:27:30
 logtext: bacula-dir JobId 17968: Start Restore Job 
Restore.2017-10-28_10.27.28_53

    time: 2017-10-28 10:27:30
 logtext: bacula-sd JobId 17968: Error: init_dev.c:123 [SE0001] Unable to stat 
device AoE-Changer0_drive0 at /mnt/Coraid/AoE-File0: ERR=Permission denied

    time: 2017-10-28 10:27:30
 logtext: bacula-sd JobId 17968: Warning: 
     Device "AoE-Changer0_drive0" in changer "AoE-Changer0" requested by DIR 
could not be opened or does not exist.

    time: 2017-10-28 10:27:30
 logtext: bacula-sd JobId 17968: Error: init_dev.c:123 [SE0001] Unable to stat 
device AoE-Changer0_drive1 at /mnt/Coraid/AoE-File0: ERR=Permission denied

    time: 2017-10-28 10:27:30
 logtext: bacula-sd JobId 17968: Warning: 
     Device "AoE-Changer0_drive1" in changer "AoE-Changer0" requested by DIR 
could not be opened or does not exist.

    time: 2017-10-28 10:27:30
 logtext: bacula-sd JobId 17968: Error: init_dev.c:123 [SE0001] Unable to stat 
device AoE-Changer0_drive2 at /mnt/Coraid/AoE-File0: ERR=Permission denied

    time: 2017-10-28 10:27:30
 logtext: bacula-sd JobId 17968: Warning: 
     Device "AoE-Changer0_drive2" in changer "AoE-Changer0" requested by DIR 
could not be opened or does not exist.

    time: 2017-10-28 10:27:30
 logtext: bacula-sd JobId 17968: Error: init_dev.c:123 [SE0001] Unable to stat 
device AoE-Changer0_drive3 at /mnt/Coraid/AoE-File0: ERR=Permission denied

    time: 2017-10-28 10:27:30
 logtext: bacula-sd JobId 17968: Warning: 
     Device "AoE-Changer0_drive3" in changer "AoE-Changer0" requested by DIR 
could not be opened or does not exist.

    time: 2017-10-28 10:27:30
 logtext: bacula-sd JobId 17968: Error: init_dev.c:123 [SE0001] Unable to stat 
device AoE-Changer0_drive4 at /mnt/Coraid/AoE-File0: ERR=Permission denied

    time: 2017-10-28 10:27:30
 logtext: bacula-sd JobId 17968: Warning: 
     Device "AoE-Changer0_drive4" in changer "AoE-Changer0" requested by DIR 
could not be opened or does not exist.

    time: 2017-10-28 10:27:30
 logtext: bacula-sd JobId 17968: Error: init_dev.c:123 [SE0001] Unable to stat 
device AoE-Changer0_drive5 at /mnt/Coraid/AoE-File0: ERR=Permission denied

    time: 2017-10-28 10:27:30
 logtext: bacula-sd JobId 17968: Warning: 
     Device "AoE-Changer0_drive5" in changer "AoE-Changer0" requested by DIR 
could not be opened or does not exist.

(about 3600 more log entries.... snipped)



(last loop and job summary)

    time: 2017-10-28 10:37:30
 logtext: bacula-sd JobId 17968: Error: init_dev.c:123 [SE0001] Unable to stat 
device AoE-Changer0_drive0 at /mnt/Coraid/AoE-File0: ERR=Permission denied

    time: 2017-10-28 10:37:30
 logtext: bacula-sd JobId 17968: Warning: 
     Device "AoE-Changer0_drive0" in changer "AoE-Changer0" requested by DIR 
could not be opened or does not exist.

    time: 2017-10-28 10:37:30
 logtext: bacula-sd JobId 17968: Error: init_dev.c:123 [SE0001] Unable to stat 
device AoE-Changer0_drive1 at /mnt/Coraid/AoE-File0: ERR=Permission denied

    time: 2017-10-28 10:37:30
 logtext: bacula-sd JobId 17968: Warning: 
     Device "AoE-Changer0_drive1" in changer "AoE-Changer0" requested by DIR 
could not be opened or does not exist.

    time: 2017-10-28 10:37:30
 logtext: bacula-sd JobId 17968: Error: init_dev.c:123 [SE0001] Unable to stat 
device AoE-Changer0_drive2 at /mnt/Coraid/AoE-File0: ERR=Permission denied

    time: 2017-10-28 10:37:30
 logtext: bacula-sd JobId 17968: Warning: 
     Device "AoE-Changer0_drive2" in changer "AoE-Changer0" requested by DIR 
could not be opened or does not exist.

    time: 2017-10-28 10:37:30
 logtext: bacula-sd JobId 17968: Error: init_dev.c:123 [SE0001] Unable to stat 
device AoE-Changer0_drive3 at /mnt/Coraid/AoE-File0: ERR=Permission denied

    time: 2017-10-28 10:37:30
 logtext: bacula-sd JobId 17968: Warning: 
     Device "AoE-Changer0_drive3" in changer "AoE-Changer0" requested by DIR 
could not be opened or does not exist.

    time: 2017-10-28 10:37:30
 logtext: bacula-sd JobId 17968: Error: init_dev.c:123 [SE0001] Unable to stat 
device AoE-Changer0_drive4 at /mnt/Coraid/AoE-File0: ERR=Permission denied

    time: 2017-10-28 10:37:30
 logtext: bacula-sd JobId 17968: Warning: 
     Device "AoE-Changer0_drive4" in changer "AoE-Changer0" requested by DIR 
could not be opened or does not exist.

    time: 2017-10-28 10:37:30
 logtext: bacula-sd JobId 17968: Error: init_dev.c:123 [SE0001] Unable to stat 
device AoE-Changer0_drive5 at /mnt/Coraid/AoE-File0: ERR=Permission denied

    time: 2017-10-28 10:37:30
 logtext: bacula-sd JobId 17968: Warning: 
     Device "AoE-Changer0_drive5" in changer "AoE-Changer0" requested by DIR 
could not be opened or does not exist.

    time: 2017-10-28 10:37:30
 logtext: bacula-sd JobId 17968: Fatal error: Device reservation failed for 
JobId=17968: Jmsg JobId=17968 type=5 level=1509208650 bacula-sd JobId 17968: 
Warning: 
     Device "AoE-Changer0_drive5" in changer "AoE-Changer0" requested by DIR 
could not be opened or does not exist.

    time: 2017-10-28 10:37:30
 logtext: bacula-dir JobId 17968: Fatal error: 
     Storage daemon didn't accept Device "AoE-Changer0" because:
     3924 Device "AoE-Changer0" not in SD Device resources or no matching Media 
Type or is disabled.

    time: 2017-10-28 10:37:31
 logtext: bacula-dir JobId 17968: Error: Bacula bacula-dir 9.0.5 (23Oct17):
  Build OS:               x86_64-pc-linux-gnu gentoo 
  JobId:                  17968
  Job:                    Restore.2017-10-28_10.27.28_53
  Restore Client:         speedy-fd
  Start time:             28-Oct-2017 10:27:30
  End time:               28-Oct-2017 10:37:30
  Files Expected:         632,217
  Files Restored:         0
  Bytes Restored:         0
  Rate:                   0.0 KB/s
  FD Errors:              1
  FD termination status:  
  SD termination status:  
  Termination:            *** Restore Error ***

    time: 2017-10-28 10:37:31
 logtext: bacula-dir JobId 17968: Error: Bacula bacula-dir 9.0.5 (23Oct17):
  Build OS:               x86_64-pc-linux-gnu gentoo 
  JobId:                  17968
  Job:                    Restore.2017-10-28_10.27.28_53
  Restore Client:         speedy-fd
  Start time:             28-Oct-2017 10:27:30
  End time:               28-Oct-2017 10:37:31
  Files Expected:         632,217
  Files Restored:         0
  Bytes Restored:         0
  Rate:                   0.0 KB/s
  FD Errors:              2
  FD termination status:  
  SD termination status:  
  Termination:            *** Restore Error ***

           jobid: 17,968
             job: Restore.2017-10-28_10.27.28_53
            name: Restore
     purgedfiles: 0
            type: R
           level: F
        clientid: 2
      clientname: speedy-fd
       jobstatus: f
       schedtime: 2017-10-28 10:27:25
       starttime: 2017-10-28 10:27:30
         endtime: 2017-10-28 10:37:31
     realendtime: 2017-10-28 10:37:31
        jobtdate: 1,509,208,651
    volsessionid: 1
  volsessiontime: 1,508,817,356
        jobfiles: 0
        jobbytes: 0
       readbytes: 0
       joberrors: 2
 jobmissingfiles: 0
          poolid: 0
        poolname: 
      priorjobid: 0
       filesetid: 0
         fileset: 
         hasbase: 0
        hascache: 0
         comment: 

*@tall
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users

Reply via email to