Hi everybody,

I’ve got the following environment:

TSM server 5.1.1.6 on Windows 2000 SP2 (IBM Netfinity server xSeries 232)
IBM LTO library 3583, 2 LTO drives and 18 slots
The library is connected with two Adaptec SCSI card 29160 Ultra160 SCSI controllers 
(driver name: Adaptec, version 6.1.530.201, date 5/14/2002). The first controller goes 
to one drive and the other controller to the robot arm and the second drive
The IBM LTO device drivers are of IBM corporation, version 5.0.3.2

I’ve got a failure when I take a backup of the primary storage pool on LTO to 
the copy storage pool. The copying stops when large files need to be transferred from 
tape to tape (large is bigger than 2 GB).  A write error is found in the activity log, 
the tape of the copy storage pool gets the status read-only and another scratch pool 
is allocated to the copy storage pool. The backup goes on a while until the next large 
file is met. The errors in the activity log are:

12/20/2002 09:55:43   ANR8302E I/O error on drive DRIVE1 (mt0.0.0.3) (OP=WRITE,
                       Error Number=121, CC=0, KEY=00, ASC=00, ASCQ=00,
                       SENSE=**NONE**, Description=An undetermined error has
                       occurred).  Refer to Appendix D in the 'Messages' manual
                       for recommended action.
12/20/2002 09:55:43   ANR1411W Access mode for volume 000020L1 now set to
                       "read-only" due to write error.
12/20/2002 10:04:31   ANR8302E I/O error on drive DRIVE1 (mt0.0.0.3) (OP=LOCATE,
                       Error Number=1104, CC=0, KEY=08, ASC=14, ASCQ=03,
                       SENSE=70.00.08.00.00.00.00.1C.00.00.00.00.14.03.00.00.20-
                       .76.00.00.00.00.00.00.00.00.00.00.00.05.00.00.9A.8D.00.0-
                       0.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.-
                       00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00-
                       .00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.0-
                       0.00.00.00.00, Description=An undetermined error has
                       occurred).  Refer to Appendix D in the 'Messages' manual
                       for recommended action.
12/20/2002 10:04:31   ANR1411W Access mode for volume 000010L1 now set to
                      "read-only" due to write error.

12/20/2002 10:44:24   ANR8302E I/O error on drive DRIVE1 (mt0.0.0.3) (OP=WRITE,
                       Error Number=121, CC=0, KEY=00, ASC=00, ASCQ=00,
                       SENSE=**NONE**, Description=An undetermined error has
                       occurred).  Refer to Appendix D in the 'Messages' manual
                       for recommended action.
12/20/2002 10:44:24   ANR1411W Access mode for volume LA0014L1 now set to
                       "read-only" due to write error.

12/20/2002 11:11:13   ANR8302E I/O error on drive DRIVE1 (mt0.0.0.3) (OP=WRITE,
                       Error Number=121, CC=0, KEY=00, ASC=00, ASCQ=00,
                       SENSE=**NONE**, Description=An undetermined error has
                       occurred).  Refer to Appendix D in the 'Messages' manual
                       for recommended action.
12/20/2002 11:11:13   ANR1411W Access mode for volume LA0015L1 now set to
                       "read-only" due to write error.
12/20/2002 11:31:06   ANR2017I Administrator ADMIN issued command: QUERY ACTLOG
                       begindate=today-1 begintime=08:00 enddate=today
                       endtime=now search=error

In the event viewer of the TSM server, I’ve got the following error:

Source: adpu160m
Type: Error
Category: None
Event ID: 9
Description: The device, \Device\Scsi\adpu160m2, did not respond within the timeout 
period.

So there is a timeout somewhere during the copy of tape to tape with the large files.

Does anybody knows how to solve this problem? How can the timeout be increased? The 
backup of the clients to the disk storage pool is fine and the flush of the disk 
storage pool to the LTO pool is without any problems as well. Is this a harware 
problem or a TSM problem?

Any help/input would greatly be appreciated,

Kurt

Reply via email to