ok, i'd like to revisit this issue. i changed scsi cards and i still get scsi crashes from btape test command. new card is adaptec 29320:
03:06.0 SCSI storage controller: Adaptec ASC-29320A U320 (rev 10) i spent the morning tar onto and off the LTO-4 drive: [5:0:15:0] tape IBM ULTRIUM-TD4 7950 /dev/st0 with no issues. then i erased the tape and started with btape again: ./btape -c bacula-sd.conf /dev/nst0 Tape block granularity is 1024 bytes. btape: butil.c:285 Using device: "/dev/nst0" for writing. btape: btape.c:368 open device "LTO4" (/dev/nst0): OK *test === Write, rewind, and re-read test === I'm going to write 1000 records and an EOF then write 1000 records and an EOF, then rewind, and re-read the data to verify that it is correct. This is an *essential* feature ... btape: btape.c:827 Wrote 1000 blocks of 64412 bytes. btape: btape.c:501 Wrote 1 EOF to "LTO4" (/dev/nst0) btape: btape.c:843 Wrote 1000 blocks of 64412 bytes. btape: btape.c:501 Wrote 1 EOF to "LTO4" (/dev/nst0) btape: btape.c:852 Rewind OK. 1000 blocks re-read correctly. hangs there with this in the dmesg log: dmesg scsi5:A:15: no active SCB for reconnecting target - issuing BUS DEVICE RESET SAVED_SCSIID == 0xf7, SAVED_LUN == 0x0, REG0 == 0xffff ACCUM = 0xc0 SEQ_FLAGS == 0xc0, SCBPTR == 0xc0, BTT == 0xffff, SINDEX == 0x1c0 SELID == 0xf0, SCB_SCSIID == 0x0, SCB_LUN == 0x0, SCB_CONTROL == 0x0 SCSIBUS[0] == 0x2, SCSISIGI == 0xc6 SXFRCTL0 == 0x88 SEQCTL0 == 0x0 >>>>>>>>>>>>>>>>>> Dump Card State Begins <<<<<<<<<<<<<<<<< scsi5: Dumping Card State at program address 0x161 Mode 0x33 Card was paused INTSTAT[0x0] SELOID[0xf] SELID[0xf0] HS_MAILBOX[0x0] INTCTL[0x80] SEQINTSTAT[0x0] SAVED_MODE[0x11] DFFSTAT[0x33] SCSISIGI[0xc6] SCSIPHASE[0x20] SCSIBUS[0x2] LASTPHASE[0xc0] SCSISEQ0[0x0] SCSISEQ1[0x12] SEQCTL0[0x0] SEQINTCTL[0x0] SEQ_FLAGS[0xc0] SEQ_FLAGS2[0x0] QFREEZE_COUNT[0xfe] KERNEL_QFREEZE_COUNT[0xfe] MK_MESSAGE_SCB[0xff00] MK_MESSAGE_SCSIID[0xff] SSTAT0[0x2] SSTAT1[0x19] SSTAT2[0x0] SSTAT3[0x0] PERRDIAG[0x0] SIMODE1[0xac] LQISTAT0[0x0] LQISTAT1[0x0] LQISTAT2[0x0] LQOSTAT0[0x0] LQOSTAT1[0x0] LQOSTAT2[0x0] SCB Count = 4 CMDS_PENDING = 1 LASTSCB 0xffff CURRSCB 0x3 NEXTSCB 0x0 qinstart = 56099 qinfifonext = 56099 QINFIFO: WAITING_TID_QUEUES: Pending list: 3 FIFO_USE[0x0] SCB_CONTROL[0x64] SCB_SCSIID[0xf7] Total 1 Kernel Free SCB list: 2 1 0 Sequencer Complete DMA-inprog list: Sequencer Complete list: Sequencer DMA-Up and Complete list: Sequencer On QFreeze and Complete list: scsi5: FIFO0 Free, LONGJMP == 0x80ff, SCB 0x0 SEQIMODE[0x3f] SEQINTSRC[0x0] DFCNTRL[0x0] DFSTATUS[0x89] SG_CACHE_SHADOW[0x2] SG_STATE[0x0] DFFSXFRCTL[0x0] SOFFCNT[0x0] MDFFSTAT[0x5] SHADDR = 0x00, SHCNT = 0x0 HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10] scsi5: FIFO1 Free, LONGJMP == 0x81f2, SCB 0x3 SEQIMODE[0x3f] SEQINTSRC[0x0] DFCNTRL[0x4] DFSTATUS[0x89] SG_CACHE_SHADOW[0x2] SG_STATE[0x0] DFFSXFRCTL[0x0] SOFFCNT[0x0] MDFFSTAT[0x5] SHADDR = 0x00, SHCNT = 0x0 HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10] LQIN: 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 scsi5: LQISTATE = 0x0, LQOSTATE = 0x0, OPTIONMODE = 0x52 scsi5: OS_SPACE_CNT = 0x20 MAXCMDCNT = 0x0 scsi5: SAVED_SCSIID = 0x0 SAVED_LUN = 0x0 SIMODE0[0xc] CCSCBCTL[0x4] scsi5: REG0 == 0xffff, SINDEX = 0x1c0, DINDEX = 0x1be scsi5: SCBPTR == 0xc0, SCB_NEXT == 0xff00, SCB_NEXT2 == 0x0 CDB c0 0 0 0 0 0 STACK: 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 <<<<<<<<<<<<<<<<< Dump Card State Ends >>>>>>>>>>>>>>>>>> so, in the end i cannot make a successful btape test run with this LTO-4 drive with two different scsi cards. i guess my question is, is this a bacula btape issue or an LTO or spectralogic scsi issue? -- michael On Mon, Oct 29, 2007 at 09:43:09PM -0400, Michael Galloway wrote: > seem to be having some scsi problems with btape test. this test is with > a spectra T50/LTO-4 attached via an LSI LSIU320 controller. i ran 100GB > of data onto the drive with tar with no issue. but when i run this: > > ./btape -c bacula-sd.conf /dev/nst0 > > test > > i get: > > *test > > === Write, rewind, and re-read test === > > I'm going to write 1000 records and an EOF > then write 1000 records and an EOF, then rewind, > and re-read the data to verify that it is correct. > > This is an *essential* feature ... > > btape: btape.c:827 Wrote 1000 blocks of 64412 bytes. > btape: btape.c:501 Wrote 1 EOF to "LTO4" (/dev/nst0) > btape: btape.c:843 Wrote 1000 blocks of 64412 bytes. > btape: btape.c:501 Wrote 1 EOF to "LTO4" (/dev/nst0) > btape: btape.c:852 Rewind OK. > 1000 blocks re-read correctly. > 29-Oct 21:27 btape JobId 0: Error: block.c:995 Read error on fd=3 at file:blk > 0:1000 on device "LTO4" (/dev/nst0). ERR=No such device or address. > btape: btape.c:864 Read block 1001 failed! ERR=No such device or address > > and i the kernel ring buffer log: > > st0: Block limits 1 - 16777215 bytes. > mptbase: ioc0: LogInfo(0x11010f00): F/W: bug! MID not found > mptbase: ioc0: LogInfo(0x11010f00): F/W: bug! MID not found > mptbase: ioc0: IOCStatus(0x004b): SCSI IOC Terminated > st0: Error 80000 (sugg. bt 0x0, driver bt 0x0, host bt 0x8). > mptscsih: ioc0: attempting task abort! (sc=ffff81011bf35240) > st 5:0:15:0: > command: Read(6): 08 00 00 fc 00 00 > mptbase: Initiating ioc0 recovery > mptscsih: ioc0: task abort: SUCCESS (sc=ffff81011bf35240) > mptbase: ioc0: IOCStatus(0x0043): SCSI Device Not There > mptscsih: ioc0: attempting target reset! (sc=ffff81011bf35240) > st 5:0:15:0: > command: Read(6): 08 00 00 fc 00 00 > mptscsih: ioc0: target reset: SUCCESS (sc=ffff81011bf35240) > mptbase: ioc0: IOCStatus(0x0043): SCSI Device Not There > mptscsih: ioc0: attempting bus reset! (sc=ffff81011bf35240) > st 5:0:15:0: > command: Read(6): 08 00 00 fc 00 00 > mptscsih: ioc0: bus reset: SUCCESS (sc=ffff81011bf35240) > mptbase: ioc0: IOCStatus(0x0047): SCSI Protocol Error > mptscsih: ioc0: Attempting host reset! (sc=ffff81011bf35240) > mptbase: Initiating ioc0 recovery > mptbase: ioc0: IOCStatus(0x0047): SCSI Protocol Error > st 5:0:15:0: scsi: Device offlined - not ready after error recovery > st0: Error 80000 (sugg. bt 0x0, driver bt 0x0, host bt 0x8). > target5:0:15: Beginning Domain Validation > target5:0:15: Domain Validation skipping write tests > target5:0:15: Ending Domain Validation > target5:0:15: FAST-80 WIDE SCSI 160.0 MB/s DT (12.5 ns, offset 126) > target5:0:15: Beginning Domain Validation > target5:0:15: Domain Validation skipping write tests > target5:0:15: Ending Domain Validation > target5:0:15: FAST-80 WIDE SCSI 160.0 MB/s DT (12.5 ns, offset 126) > target5:0:15: Beginning Domain Validation > target5:0:15: Domain Validation skipping write tests > target5:0:15: Ending Domain Validation > target5:0:15: FAST-80 WIDE SCSI 160.0 MB/s DT (12.5 ns, offset 126) > target5:0:15: Beginning Domain Validation > target5:0:15: Domain Validation skipping write tests > target5:0:15: Ending Domain Validation > target5:0:15: FAST-80 WIDE SCSI 160.0 MB/s DT (12.5 ns, offset 126) > > i've reseated my cables and terminator. reseated the scsi card. any idea > where the problem is? this is centOS 5, kernel is: > > 2.6.18-8.1.14.el5 #1 SMP Thu Sep 27 19:05:32 EDT 2007 x86_64 x86_64 x86_64 > GNU/Linux > > -- michael > > ------------------------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. > Still grepping through log files to find problems? Stop. > Now Search log events and configuration files using AJAX and a browser. > Download your FREE copy of Splunk now >> http://get.splunk.com/ > _______________________________________________ > Bacula-users mailing list > Bacula-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/bacula-users > ------------------------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/ _______________________________________________ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users