I'm running 4.4-REL on a bunch of older systems with onboard Adaptec 7870
controllers and recently I've been getting lots of console output from one
of them.  I'm pretty sure it means that one of my drives is dying (which
I've suyspected for a while), but I'm just curious as to what the messages
really mean, and if I can determine the bad drive just from the messages.

The dmesg information (hardware probes):

ahc0: <Adaptec aic7870 SCSI adapter)> port 0xf800-0xf8ff mem
0xffbef000-0xffbeffff irq 11 at device 11.0 on pci0
aic7870: Wide Channel A, SCSI Id=7, 16/255 SCBs
da0 at ahc0 bus 0 target 0 lun 0
da0: <SEAGTE ST32430N 0510> Fixed Direct Access SCSI-2 device
da0: 10.000MB/s transfers (10.000MHz, offset 15), Tagged Queuing Enabled
da0: 2049MB (4197405 512 byte sectors: 64H 32S/T 2049C)
da1 at ahc0 bus 0 targer 4 lun 0
da1: <FUJITSU M2694E-512 8134> Fixed Direct Access SCSI-CCS device
da1: 3.300MB/s transfers
da1: 1033MB (2117025 512 byte sectors: 64H 32S/T 1033C)
da2 at ahc0 bus 0 target 6 lun 0
da2: <OEM DCRS04Z 0101> Fixed Direct Access SCSI-2 device
da2: 10.000MB/s transfer (10.000MHz, offset 15), Tagged Queueing Enabled
da2: 4340MB (8888543 512 byte sectors: 64H 32S/T 4340C)

The console error messages:

(da0:ahc0:0:0:0): BDR message in message buffer
(da0:ahc0:0:0:0): SCB 0xe - timed out
ahc0: Dumping Card State in Data-in phase, at SEQADDR 0x7a
< snip dump data >
(da0:ahc0:0:0:0): no longer in timeout, status = 34b
ahc0: Issued Channel A Bus Reset.  3 SCBs aborted

I get these kinds of errors on da0, da1 and da2.  However, I only see this
message on da1:

(da1:ahc0:0:4:0): Unexpected busfree in Data-in phase

Is this the error that triggers off all the bus reset (and subsequent
timeouts and aborts)?  Should i look at replacing da1 real soon now?

--
Matt Emmerton



To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-questions" in the body of the message

Reply via email to