Kern Sibbald wrote:
On Thursday 17 November 2005 11:01, Julien Cigar wrote:
  
I think I found the problem.
I changed the scsi id of the tape drive from ID 0 to ID 6 and it *seems*
to work better (I did a full backup and no problems occured yet). I read
that apparently ID 0 and ID 1 should not be used for tape drives (it's
for booting devices only ...)
    

Interesting!  I'm pleased to hear that you solved the problem, and it only 
goes to show how complicated hardware can be at times ...

  
The SCSI IDs, in order of priority, are: 07,06, ..., 00, 15, 14,..., 08, where 07 is the highest. Think of it as interrupt levels in a real-time OS, which is pretty much what it is. The controller itself should be the highest priority device so that it always responds to commands. Tape drives should be on high priority IDs so that commands to the tape drives are not queued for too long, else the device driver will see it as a SCSI timeout error. It all depends on what the device driver does when a SCSI command times out. For block random access devices, sd can easily just retry the command some number of times. The st driver has the problem that it is not always so easy to just retry a command to a streaming device, so perhaps it is not as forgiving as the sd driver. Easier to avoid problems by putting the streaming devices at the higher priority IDs and random access devices at the lower.

  
Kern Sibbald wrote:
    
On Monday 07 November 2005 15:15, Julien Cigar wrote:
      
You think I can ignore the problem ?
When I list media you can see that the tape Weekly-0003 has not been
filled up completely (it was the tape with write errors ...)
        
Well, that answers the question I had: the drive could not recover from
the error, and it *did* notify Bacula.  In principle, Bacula stopped
writing when the tape/drive got the error, so your backup should be good,
but personally, I would be very worried whether the data on the tape was
readable or not.

At a minimum, I would start doing Verifys of all my tape jobs to ensure
that the tape can be read -- they are slow, but at least you will know. 
I would also try doing a btape "fill" on the tape that got the errors
(after ensuring that all the files are backed up elsewhere). If it fails,
I would trash the tape.

The best solution is to get a DLT or SDLT drive.

If you cannot, one thing that I noticed here with my HP drive is that
tapes from some manufacturers fail all the time, and tapes from others
last two years before failing (under heavy use).  For example, if I
remember right, I have no problem with Sony tapes in my drive, but
Imation tapes cause me problems (or vise-versa). I forget because I only
use the drive for testing, and I frequently need to throw out tapes.  The
only way you can know is to try tapes from different manufacturers in
your drive, and the only thing I can say for sure, is that for my drive,
there is a lot of difference between tapes from different manufacturers.

      
*list media
Pool: WeeklyFullBackupPool
+---------+-------------+-----------+-------------+----------+-----------
-- -+---------+------+-----------+-----------+---------------------+

| MediaId | VolumeName  | VolStatus | VolBytes    | VolFiles |

VolRetention | Recycle | Slot | InChanger | MediaType |
LastWritten         |
+---------+-------------+-----------+-------------+----------+-----------
-- -+---------+------+-----------+-----------+---------------------+

| 1       | Weekly-0001 | Full      | 71563809646 | 73       |

2592000      | 1       | 0    | 0         | DAT       | 2005-10-28
10:38:32 |

| 2       | Weekly-0002 | Full      | 79126531739 | 79       |

2592000      | 1       | 0    | 0         | DAT       | 2005-10-30
11:28:16 |

| 3       | Weekly-0003 | Full      | 45914843881 | 47       |

2592000      | 1       | 0    | 0         | DAT       | 2005-11-07
06:01:46 |

| 4       | Weekly-0004 | Full      | 81241762264 | 81       |

2592000      | 1       | 0    | 0         | DAT       | 2005-11-07
12:17:00 |

However, the jobs terminate with a "Backup OK" termination ... strange

Kern Sibbald wrote:
        
On Monday 07 November 2005 09:04, Julien Cigar wrote:
          
Hello,

I writed a mail some weeks ago for a problem with my Sony drive (write
errors).
The drive is a Sony SDX-500C on a Adaptec 2940 Ultra SCSI.
Someone told me to upgrade the bios of the drive, which I did. It
worked fine for a week, but now I'm still having writing error
messages :

Nov  7 06:01:46 localhost kernel: st0: Error with sense data: <6>st0:
Current: sense key: Medium Error
Nov  7 06:01:46 localhost kernel:     Additional sense: Write append
error Nov  7 06:01:46 localhost kernel: Info fld=0xfc00
Nov  7 06:01:46 localhost kernel: st0: Error with sense data: <6>st0:
Current: sense key: Medium Error
Nov  7 06:01:46 localhost kernel:     Additional sense: Write append
error Nov  7 06:01:46 localhost kernel: Info fld=0x1
Nov  7 08:42:26 localhost kernel: st0: Error with sense data: <6>st0:
Current: sense key: Unit Attention
Nov  7 08:42:26 localhost kernel:     Additional sense: Not ready to
ready change, medium may have changed
Nov  7 08:47:37 localhost kernel: st0: MTSETDRVBUFFER only allowed for
root.

What happens is, it backs up perfectly fine for a while backup,
restoring from tape works too! Then all of a sudden it starts giving an
error ...
I don't think it's tape related, the tapes are new and the problem
occurs only randomly ...
I've checked the cable too

Any idea welcomed ...
            
All the errors at 06:01 appear to me to be typical write errors that one
often gets on DDS tapes. From what I see, the drive recovered and
continued. I doubt that Bacula was even aware of the problem -- though
you can check in the Job report.

The errors are 8:42 are a bit worrysome. I would attempt to correlate
them with what Bacula was doing.  It may only be that Bacula asked for
a new tape and tried to read while there was nothing in the drive, or
it may be some real failure.

The "error" at 8:47 is apparently after you restarted the FD. It
attempts to set the tape to variable tape format, and your kernel
forbids the request (ioctl) if the SD is not being run as root. This
can be ignored if your tape drive mode is set properly before starting
the SD.

          
----------
Here's the output of /proc/scsi/scsi :

phoenix:/home/jcigar# cat /proc/scsi/scsi
Attached devices:
Host: scsi0 Channel: 00 Id: 01 Lun: 00
Vendor: SONY     Model: SDX-500C         Rev: 0204
Type:   Sequential-Access                ANSI SCSI revision: 02

----------
Here's the output of /proc/scsi/aic7xxx/0 :

phoenix:/home/jcigar# cat /proc/scsi/aic7xxx/0
Adaptec AIC7xxx driver version: 6.2.36
Adaptec 2940 Ultra SCSI adapter
aic7880: Ultra Wide Channel A, SCSI Id=0, 16/253 SCBs
Allocated SCBs: 4, SG List Length: 128

Serial EEPROM:
0x0238 0x0238 0x0238 0x0238 0x0238 0x0238 0x0238 0x0238
0x0238 0x0238 0x0238 0x0238 0x0238 0x0238 0x0238 0x0238
0x18b6 0x005d 0x2800 0x0010 0xff00 0xffff 0xffff 0xffff
0xffff 0xffff 0xffff 0xffff 0xffff 0xffff 0x00ff 0x6499

Target 0 Negotiation Settings
      User: 20.000MB/s transfers (10.000MHz, offset 127, 16bit)
Target 1 Negotiation Settings
      User: 20.000MB/s transfers (10.000MHz, offset 127, 16bit)
      Goal: 40.000MB/s transfers (20.000MHz, offset 8, 16bit)
      Curr: 40.000MB/s transfers (20.000MHz, offset 8, 16bit)
      Channel A Target 1 Lun 0 Settings
              Commands Queued 3051621
              Commands Active 0
              Command Openings 1
              Max Tagged Openings 0
              Device Queue Frozen Count 0
Target 2 Negotiation Settings
      User: 20.000MB/s transfers (10.000MHz, offset 127, 16bit)
Target 3 Negotiation Settings
      User: 20.000MB/s transfers (10.000MHz, offset 127, 16bit)
Target 4 Negotiation Settings
      User: 20.000MB/s transfers (10.000MHz, offset 127, 16bit)
Target 5 Negotiation Settings
      User: 20.000MB/s transfers (10.000MHz, offset 127, 16bit)
Target 6 Negotiation Settings
      User: 20.000MB/s transfers (10.000MHz, offset 127, 16bit)
Target 7 Negotiation Settings
      User: 20.000MB/s transfers (10.000MHz, offset 127, 16bit)
Target 8 Negotiation Settings
      User: 20.000MB/s transfers (10.000MHz, offset 127, 16bit)
Target 9 Negotiation Settings
      User: 20.000MB/s transfers (10.000MHz, offset 127, 16bit)
Target 10 Negotiation Settings
      User: 20.000MB/s transfers (10.000MHz, offset 127, 16bit)
Target 11 Negotiation Settings
      User: 20.000MB/s transfers (10.000MHz, offset 127, 16bit)
Target 12 Negotiation Settings
      User: 20.000MB/s transfers (10.000MHz, offset 127, 16bit)
Target 13 Negotiation Settings
      User: 20.000MB/s transfers (10.000MHz, offset 127, 16bit)
Target 14 Negotiation Settings
      User: 20.000MB/s transfers (10.000MHz, offset 127, 16bit)
Target 15 Negotiation Settings
      User: 20.000MB/s transfers (10.000MHz, offset 127, 16bit)



-------------------------------------------------------
SF.Net email is sponsored by:
Tame your development challenges with Apache's Geronimo App Server.
Download it for free - -and be entered to win a 42" plasma tv or your
very own Sony(tm)PSP.  Click here to play:
http://sourceforge.net/geronimo.php
_______________________________________________
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users
            
-------------------------------------------------------
SF.Net email is sponsored by:
Tame your development challenges with Apache's Geronimo App Server.
Download it for free - -and be entered to win a 42" plasma tv or your
very own Sony(tm)PSP.  Click here to play:
http://sourceforge.net/geronimo.php
_______________________________________________
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users
        
-------------------------------------------------------
This SF.Net email is sponsored by the JBoss Inc.  Get Certified Today
Register for a JBoss Training Course.  Free Certification Exam
for All Training Attendees Through End of 2005. For more info visit:
http://ads.osdn.com/?ad_id=7628&alloc_id=16845&op=click
_______________________________________________
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users
    


-------------------------------------------------------
This SF.Net email is sponsored by the JBoss Inc.  Get Certified Today
Register for a JBoss Training Course.  Free Certification Exam
for All Training Attendees Through End of 2005. For more info visit:
http://ads.osdn.com/?ad_id=7628&alloc_id=16845&op=click
_______________________________________________
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users
  

Reply via email to