Jim,
Thank you for the explanation. I have 'discovered' that is a typical
situation that makes the system unstable.
Just for curiosity, this morning it happened again. Below, you can che
the log oupu. This time a HBA with LSI 1068E Chip, mpt driver, the
previous one was with a LSI 2008, mpt_sas driver.
In this case the ZFS 'dicovered' the error and it was able to self
healing, and the system is working smooth.
Antonio
May 31 10:48:11 seal.macc.unican.es scsi: [ID 243001 kern.warning]
WARNING: /pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:48:11 seal.macc.unican.es mpt_handle_event_sync:
IOCStatus=0x8000, IOCLogInfo=0x31123000
May 31 10:48:11 seal.macc.unican.es scsi: [ID 243001 kern.warning]
WARNING: /pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:48:11 seal.macc.unican.es mpt_handle_event:
IOCStatus=0x8000, IOCLogInfo=0x31123000
May 31 10:48:13 seal.macc.unican.es scsi: [ID 365881 kern.info]
/pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:48:13 seal.macc.unican.es Log info 0x31123000 received for
target 12.
May 31 10:48:13 seal.macc.unican.es scsi_status=0x0,
ioc_status=0x804b, scsi_state=0xc
May 31 10:48:13 seal.macc.unican.es scsi: [ID 365881 kern.info]
/pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:48:13 seal.macc.unican.es Log info 0x31123000 received for
target 12.
May 31 10:48:13 seal.macc.unican.es scsi_status=0x0,
ioc_status=0x804b, scsi_state=0xc
May 31 10:48:13 seal.macc.unican.es scsi: [ID 365881 kern.info]
/pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:48:13 seal.macc.unican.es Log info 0x31123000 received for
target 12.
May 31 10:48:13 seal.macc.unican.es scsi_status=0x0,
ioc_status=0x804b, scsi_state=0xc
May 31 10:48:13 seal.macc.unican.es scsi: [ID 365881 kern.info]
/pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:48:13 seal.macc.unican.es Log info 0x31123000 received for
target 12.
May 31 10:48:13 seal.macc.unican.es scsi_status=0x0,
ioc_status=0x804b, scsi_state=0xc
May 31 10:48:16 seal.macc.unican.es scsi: [ID 243001 kern.warning]
WARNING: /pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:48:16 seal.macc.unican.es mpt_handle_event_sync:
IOCStatus=0x8000, IOCLogInfo=0x31111000
May 31 10:48:16 seal.macc.unican.es scsi: [ID 243001 kern.warning]
WARNING: /pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:48:16 seal.macc.unican.es mpt_handle_event:
IOCStatus=0x8000, IOCLogInfo=0x31111000
May 31 10:48:16 seal.macc.unican.es scsi: [ID 243001 kern.warning]
WARNING: /pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:48:16 seal.macc.unican.es mpt_handle_event_sync:
IOCStatus=0x8000, IOCLogInfo=0x31112000
May 31 10:48:16 seal.macc.unican.es scsi: [ID 243001 kern.warning]
WARNING: /pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:48:16 seal.macc.unican.es mpt_handle_event:
IOCStatus=0x8000, IOCLogInfo=0x31112000
May 31 10:48:17 seal.macc.unican.es scsi: [ID 365881 kern.info]
/pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:48:17 seal.macc.unican.es Log info 0x31111000 received for
target 12.
May 31 10:48:17 seal.macc.unican.es scsi_status=0x0,
ioc_status=0x804b, scsi_state=0xc
May 31 10:48:20 seal.macc.unican.es scsi: [ID 243001 kern.warning]
WARNING: /pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:48:20 seal.macc.unican.es SAS Discovery Error on port 0.
DiscoveryStatus is DiscoveryStatus is |Unaddressable device found|
May 31 10:48:22 seal.macc.unican.es scsi: [ID 243001 kern.warning]
WARNING: /pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:48:22 seal.macc.unican.es mpt_handle_event_sync:
IOCStatus=0x8000, IOCLogInfo=0x31123000
May 31 10:48:22 seal.macc.unican.es scsi: [ID 243001 kern.warning]
WARNING: /pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:48:22 seal.macc.unican.es mpt_handle_event:
IOCStatus=0x8000, IOCLogInfo=0x31123000
May 31 10:48:27 seal.macc.unican.es scsi: [ID 243001 kern.warning]
WARNING: /pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:48:27 seal.macc.unican.es mpt_handle_event_sync:
IOCStatus=0x8000, IOCLogInfo=0x31111000
May 31 10:48:27 seal.macc.unican.es scsi: [ID 243001 kern.warning]
WARNING: /pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:48:27 seal.macc.unican.es mpt_handle_event:
IOCStatus=0x8000, IOCLogInfo=0x31111000
May 31 10:48:27 seal.macc.unican.es scsi: [ID 243001 kern.warning]
WARNING: /pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:48:27 seal.macc.unican.es mpt_handle_event_sync:
IOCStatus=0x8000, IOCLogInfo=0x31112000
May 31 10:48:27 seal.macc.unican.es scsi: [ID 243001 kern.warning]
WARNING: /pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:48:27 seal.macc.unican.es mpt_handle_event:
IOCStatus=0x8000, IOCLogInfo=0x31112000
May 31 10:48:28 seal.macc.unican.es scsi: [ID 365881 kern.info]
/pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:48:28 seal.macc.unican.es Log info 0x31111000 received for
target 12.
May 31 10:48:28 seal.macc.unican.es scsi_status=0x0,
ioc_status=0x804b, scsi_state=0xc
May 31 10:48:31 seal.macc.unican.es scsi: [ID 243001 kern.warning]
WARNING: /pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:48:31 seal.macc.unican.es SAS Discovery Error on port 0.
DiscoveryStatus is DiscoveryStatus is |Unaddressable device found|
May 31 10:48:34 seal.macc.unican.es scsi: [ID 243001 kern.warning]
WARNING: /pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:48:34 seal.macc.unican.es mpt_handle_event_sync:
IOCStatus=0x8000, IOCLogInfo=0x31123000
May 31 10:48:34 seal.macc.unican.es scsi: [ID 243001 kern.warning]
WARNING: /pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:48:34 seal.macc.unican.es mpt_handle_event:
IOCStatus=0x8000, IOCLogInfo=0x31123000
May 31 10:48:38 seal.macc.unican.es scsi: [ID 243001 kern.warning]
WARNING: /pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:48:38 seal.macc.unican.es mpt_handle_event_sync:
IOCStatus=0x8000, IOCLogInfo=0x31111000
May 31 10:48:38 seal.macc.unican.es scsi: [ID 243001 kern.warning]
WARNING: /pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:48:38 seal.macc.unican.es mpt_handle_event:
IOCStatus=0x8000, IOCLogInfo=0x31111000
May 31 10:48:38 seal.macc.unican.es scsi: [ID 243001 kern.warning]
WARNING: /pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:48:38 seal.macc.unican.es mpt_handle_event_sync:
IOCStatus=0x8000, IOCLogInfo=0x31112000
May 31 10:48:38 seal.macc.unican.es scsi: [ID 243001 kern.warning]
WARNING: /pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:48:38 seal.macc.unican.es mpt_handle_event:
IOCStatus=0x8000, IOCLogInfo=0x31112000
May 31 10:48:40 seal.macc.unican.es scsi: [ID 365881 kern.info]
/pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:48:40 seal.macc.unican.es Log info 0x31111000 received for
target 12.
May 31 10:48:40 seal.macc.unican.es scsi_status=0x0,
ioc_status=0x804b, scsi_state=0xc
May 31 10:48:43 seal.macc.unican.es scsi: [ID 243001 kern.warning]
WARNING: /pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:48:43 seal.macc.unican.es SAS Discovery Error on port 0.
DiscoveryStatus is DiscoveryStatus is |Unaddressable device found|
May 31 10:48:45 seal.macc.unican.es scsi: [ID 243001 kern.warning]
WARNING: /pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:48:45 seal.macc.unican.es mpt_handle_event_sync:
IOCStatus=0x8000, IOCLogInfo=0x31123000
May 31 10:48:45 seal.macc.unican.es scsi: [ID 243001 kern.warning]
WARNING: /pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:48:45 seal.macc.unican.es mpt_handle_event:
IOCStatus=0x8000, IOCLogInfo=0x31123000
May 31 10:48:49 seal.macc.unican.es scsi: [ID 243001 kern.warning]
WARNING: /pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:48:49 seal.macc.unican.es mpt_handle_event_sync:
IOCStatus=0x8000, IOCLogInfo=0x31111000
May 31 10:48:49 seal.macc.unican.es scsi: [ID 243001 kern.warning]
WARNING: /pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:48:49 seal.macc.unican.es mpt_handle_event:
IOCStatus=0x8000, IOCLogInfo=0x31111000
May 31 10:48:49 seal.macc.unican.es scsi: [ID 243001 kern.warning]
WARNING: /pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:48:49 seal.macc.unican.es mpt_handle_event_sync:
IOCStatus=0x8000, IOCLogInfo=0x31112000
May 31 10:48:49 seal.macc.unican.es scsi: [ID 243001 kern.warning]
WARNING: /pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:48:49 seal.macc.unican.es mpt_handle_event:
IOCStatus=0x8000, IOCLogInfo=0x31112000
May 31 10:48:51 seal.macc.unican.es scsi: [ID 365881 kern.info]
/pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:48:51 seal.macc.unican.es Log info 0x31111000 received for
target 12.
May 31 10:48:51 seal.macc.unican.es scsi_status=0x0,
ioc_status=0x804b, scsi_state=0xc
May 31 10:48:54 seal.macc.unican.es scsi: [ID 243001 kern.warning]
WARNING: /pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:48:54 seal.macc.unican.es SAS Discovery Error on port 0.
DiscoveryStatus is DiscoveryStatus is |Unaddressable device found|
May 31 10:48:56 seal.macc.unican.es scsi: [ID 243001 kern.warning]
WARNING: /pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:48:56 seal.macc.unican.es mpt_handle_event_sync:
IOCStatus=0x8000, IOCLogInfo=0x31123000
May 31 10:48:56 seal.macc.unican.es scsi: [ID 243001 kern.warning]
WARNING: /pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:48:56 seal.macc.unican.es mpt_handle_event:
IOCStatus=0x8000, IOCLogInfo=0x31123000
May 31 10:48:59 seal.macc.unican.es scsi: [ID 107833 kern.warning]
WARNING: /pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:48:59 seal.macc.unican.es Disconnected command timeout for
Target 10
May 31 10:49:01 seal.macc.unican.es scsi: [ID 243001 kern.warning]
WARNING: /pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:49:01 seal.macc.unican.es mpt_handle_event_sync:
IOCStatus=0x8000, IOCLogInfo=0x31111000
May 31 10:49:01 seal.macc.unican.es scsi: [ID 243001 kern.warning]
WARNING: /pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:49:01 seal.macc.unican.es mpt_handle_event_sync:
IOCStatus=0x8000, IOCLogInfo=0x31112000
May 31 10:49:01 seal.macc.unican.es scsi: [ID 365881 kern.info]
/pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:49:01 seal.macc.unican.es Log info 0x31140000 received for
target 10.
May 31 10:49:01 seal.macc.unican.es scsi_status=0x0,
ioc_status=0x8048, scsi_state=0xc
May 31 10:49:01 seal.macc.unican.es scsi: [ID 243001 kern.warning]
WARNING: /pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:49:01 seal.macc.unican.es mpt_handle_event:
IOCStatus=0x8000, IOCLogInfo=0x31111000
May 31 10:49:01 seal.macc.unican.es scsi: [ID 243001 kern.warning]
WARNING: /pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:49:01 seal.macc.unican.es mpt_handle_event:
IOCStatus=0x8000, IOCLogInfo=0x31112000
May 31 10:49:01 seal.macc.unican.es scsi: [ID 107833 kern.warning]
WARNING: /pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:49:01 seal.macc.unican.es passthrough command timeout
May 31 10:49:01 seal.macc.unican.es scsi: [ID 365881 kern.info]
/pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:49:01 seal.macc.unican.es Rev. 8 LSI, Inc. 1068E found.
May 31 10:49:01 seal.macc.unican.es scsi: [ID 365881 kern.info]
/pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:49:01 seal.macc.unican.es mpt2 supports power management.
May 31 10:49:02 seal.macc.unican.es scsi: [ID 365881 kern.info]
/pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:49:02 seal.macc.unican.es mpt2: IOC Operational.
May 31 10:49:16 seal.macc.unican.es scsi: [ID 107833 kern.warning]
WARNING: /pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:49:16 seal.macc.unican.es Can only start 1 task management
command at a time
May 31 10:50:16 seal.macc.unican.es scsi: [ID 365881 kern.info]
/pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:50:16 seal.macc.unican.es Rev. 8 LSI, Inc. 1068E found.
May 31 10:50:16 seal.macc.unican.es scsi: [ID 365881 kern.info]
/pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:50:16 seal.macc.unican.es mpt2 supports power management.
May 31 10:50:16 seal.macc.unican.es scsi: [ID 365881 kern.info]
/pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:50:16 seal.macc.unican.es mpt2: IOC Operational.
May 31 10:50:47 seal.macc.unican.es scsi: [ID 365881 kern.info]
/pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:50:47 seal.macc.unican.es Rev. 8 LSI, Inc. 1068E found.
May 31 10:50:47 seal.macc.unican.es scsi: [ID 365881 kern.info]
/pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:50:47 seal.macc.unican.es mpt2 supports power management.
May 31 10:50:50 seal.macc.unican.es scsi: [ID 365881 kern.info]
/pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:50:50 seal.macc.unican.es mpt2: IOC Operational.
May 31 10:51:16 seal.macc.unican.es scsi: [ID 365881 kern.info]
/pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:51:16 seal.macc.unican.es Rev. 8 LSI, Inc. 1068E found.
May 31 10:51:16 seal.macc.unican.es scsi: [ID 365881 kern.info]
/pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:51:16 seal.macc.unican.es mpt2 supports power management.
May 31 10:51:20 seal.macc.unican.es scsi: [ID 365881 kern.info]
/pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:51:20 seal.macc.unican.es mpt2: IOC Operational.
May 31 10:52:46 seal.macc.unican.es scsi: [ID 107833 kern.warning]
WARNING: /pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:52:46 seal.macc.unican.es Disconnected command timeout for
Target 11
May 31 10:52:47 seal.macc.unican.es scsi: [ID 365881 kern.info]
/pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:52:47 seal.macc.unican.es Log info 0x31140000 received for
target 11.
May 31 10:52:47 seal.macc.unican.es scsi_status=0x0,
ioc_status=0x8048, scsi_state=0xc
May 31 10:52:47 seal.macc.unican.es scsi: [ID 365881 kern.info]
/pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:52:47 seal.macc.unican.es Log info 0x31130000 received for
target 11.
May 31 10:52:47 seal.macc.unican.es scsi_status=0x0,
ioc_status=0x8048, scsi_state=0xc
May 31 10:52:47 seal.macc.unican.es scsi: [ID 365881 kern.info]
/pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:52:47 seal.macc.unican.es Log info 0x31130000 received for
target 11.
May 31 10:52:47 seal.macc.unican.es scsi_status=0x0,
ioc_status=0x8048, scsi_state=0xc
May 31 10:52:47 seal.macc.unican.es scsi: [ID 365881 kern.info]
/pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:52:47 seal.macc.unican.es Log info 0x31130000 received for
target 11.
May 31 10:52:47 seal.macc.unican.es scsi_status=0x0,
ioc_status=0x8048, scsi_state=0xc
May 31 10:52:47 seal.macc.unican.es scsi: [ID 365881 kern.info]
/pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:52:47 seal.macc.unican.es Log info 0x31130000 received for
target 11.
May 31 10:52:47 seal.macc.unican.es scsi_status=0x0,
ioc_status=0x8048, scsi_state=0xc
May 31 10:52:51 seal.macc.unican.es scsi: [ID 243001 kern.warning]
WARNING: /pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:52:51 seal.macc.unican.es mpt_handle_event_sync:
IOCStatus=0x8000, IOCLogInfo=0x31111000
May 31 10:52:51 seal.macc.unican.es scsi: [ID 243001 kern.warning]
WARNING: /pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:52:51 seal.macc.unican.es mpt_handle_event:
IOCStatus=0x8000, IOCLogInfo=0x31111000
May 31 10:52:53 seal.macc.unican.es scsi: [ID 365881 kern.info]
/pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:52:53 seal.macc.unican.es Log info 0x31111000 received for
target 11.
May 31 10:52:53 seal.macc.unican.es scsi_status=0x0,
ioc_status=0x804b, scsi_state=0xc
May 31 10:52:56 seal.macc.unican.es scsi: [ID 243001 kern.warning]
WARNING: /pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:52:56 seal.macc.unican.es SAS Discovery Error on port 0.
DiscoveryStatus is DiscoveryStatus is |Unaddressable device found|
May 31 10:53:37 seal.macc.unican.es scsi: [ID 107833 kern.warning]
WARNING: /pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:53:37 seal.macc.unican.es passthrough command timeout
May 31 10:53:37 seal.macc.unican.es scsi: [ID 365881 kern.info]
/pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:53:37 seal.macc.unican.es Rev. 8 LSI, Inc. 1068E found.
May 31 10:53:37 seal.macc.unican.es scsi: [ID 365881 kern.info]
/pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:53:37 seal.macc.unican.es mpt2 supports power management.
May 31 10:53:37 seal.macc.unican.es scsi: [ID 365881 kern.info]
/pci@7a,0/pci8086,3410@9/pci1000,3140@0 (mpt2):
May 31 10:53:37 seal.macc.unican.es mpt2: IOC Operational.
May 31 10:54:10 seal.macc.unican.es fmd: [ID 377184 daemon.error]
SUNW-MSG-ID: ZFS-8000-FD, TYPE: Fault, VER: 1, SEVERITY: Major
May 31 10:54:10 seal.macc.unican.es EVENT-TIME: Thu May 31 10:54:09 CEST
2012
May 31 10:54:10 seal.macc.unican.es PLATFORM: X8DTH-i-6-iF-6F, CSN:
1234567890, HOSTNAME: seal.macc.unican.es
May 31 10:54:10 seal.macc.unican.es SOURCE: zfs-diagnosis, REV: 1.0
May 31 10:54:10 seal.macc.unican.es EVENT-ID:
5d33a13b-61e3-cf16-86a7-e9587d510170
May 31 10:54:10 seal.macc.unican.es DESC: The number of I/O errors
associated with a ZFS device exceeded
May 31 10:54:10 seal.macc.unican.es acceptable levels. Refer
to http://sun.com/msg/ZFS-8000-FD for more information.
May 31 10:54:10 seal.macc.unican.es AUTO-RESPONSE: The device has been
offlined and marked as faulted. An attempt
May 31 10:54:10 seal.macc.unican.es will be made to activate a
hot spare if available.
May 31 10:54:10 seal.macc.unican.es IMPACT: Fault tolerance of the pool
may be compromised.
May 31 10:54:10 seal.macc.unican.es REC-ACTION: Run 'zpool status -x'
and replace the bad device.
--
Antonio S. Cofiño
Grupo de Meteorología de Santander
Dep. de Matemática Aplicada y
Ciencias de la Computación
Universidad de Cantabria
Escuela de Caminos
Avenida de los Castros, 44
39005 Santander, Spain
Tel: (+34) 942 20 1731
Fax: (+34) 942 20 1703
http://www.meteo.unican.es
mailto:antonio.cof...@unican.es
El 30/05/2012 18:52, Jim Klimov escribió:
2012-05-30 20:25, "Antonio S. Cofiño" wrote:
Dear All,
It may be this not the correct mailing list, but I'm having a ZFS issue
when a disk is failing.
I hope other users might help more on specific details, but while
we're waiting for their answer - please search the list archives.
Similar description of the problem comes up every few months, and
it seems to be a fundamental flaw of (consumerish?) SATA drives
with backplanes, leading to reset storms.
I remember the mechanism being something like this: a problematic
disk is detected and the system tries to have it reset so that it
might stop causing problems. The SATA controller either ignores
the command or takes too long to complete/respond, so the system
goes up the stack and next resets the backplane or ultimately the
controller.
I am not qualified to comment whether this issue is fundamental
(i.e. in SATA protocols) or incidental (cheap drives don't do
advanced stuff, while expensive SATAs might be ok in this regard).
There were discussions about using SATA-SAS interposers, but they
might not fit mechanically, add latency and instability, and raise
the system price to the point where native SAS disks would be
better...
Now, waiting for experts to chime in on whatever I missed ;)
HTH,
//Jim Klimov
_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss