Do you have sata Native Command Queuing enabled?  I've experienced delays of 
just under one minute when NCQ is enabled, that do not occur when NCQ is 
disabled.  If all threads comprising the parallel zfs destroy hang for a 
minute, I bet  its the hang that causes "no more processes".  I have opened a 
trouble ticket on this issue, and am waiting for feedback.  In the mean time, 
I've disabled NCQ by adding the line below to /etc/system (and rebooting).

       set sata:sata_func_enable = 0x5

While this probably incurs some preformance penalty, its better than the one 
minute hangs.

The following is a typical log entry that appears at the conclusion of a one 
minute period
"stall".

Oct 21 07:56:09 host marvell88sx: [ID 670675 kern.info] NOTICE: marvell88sx1: 
device on port 0 reset: DMA command timeout
Oct 21 07:56:09 host sata: [ID 801593 kern.notice] NOTICE: /[EMAIL 
PROTECTED],0/pci1022,[EMAIL PROTECTED]/pci11ab,[EMAIL PROTECTED]:
Oct 21 07:56:09 host  port 0: device reset
Oct 21 07:56:09 host marvell88sx: [ID 670675 kern.info] NOTICE: marvell88sx1: 
device on port 0 reset: device disconnected or
 device error
Oct 21 07:56:09 host sata: [ID 801593 kern.notice] NOTICE: /[EMAIL 
PROTECTED],0/pci1022,[EMAIL PROTECTED]/pci11ab,[EMAIL PROTECTED]:
Oct 21 07:56:09 host  port 0: device reset
Oct 21 07:56:09 host sata: [ID 801593 kern.notice] NOTICE: /[EMAIL 
PROTECTED],0/pci1022,[EMAIL PROTECTED]/pci11ab,[EMAIL PROTECTED]:
Oct 21 07:56:09 host  port 0: link lost
Oct 21 07:56:09 host sata: [ID 801593 kern.notice] NOTICE: /[EMAIL 
PROTECTED],0/pci1022,[EMAIL PROTECTED]/pci11ab,[EMAIL PROTECTED]:
Oct 21 07:56:09 host  port 0: link established
Oct 21 07:56:09 host marvell88sx: [ID 812950 kern.warning] WARNING: 
marvell88sx1: error on port 0:
Oct 21 07:56:09 host marvell88sx: [ID 517869 kern.info]        device 
disconnected
Oct 21 07:56:09 host marvell88sx: [ID 517869 kern.info]        device connected
Oct 21 07:56:09 host scsi: [ID 107833 kern.warning] WARNING: /[EMAIL 
PROTECTED],0/pci1022,[EMAIL PROTECTED]/pci11ab,[EMAIL PROTECTED]/[EMAIL 
PROTECTED],0 (sd6):
Oct 21 07:56:09 host   Error for Command: write(10)               Error Level: 
Retryable
Oct 21 07:56:09 host scsi: [ID 107833 kern.notice]     Requested Block: 
376060962                 Error Block: 376060962
Oct 21 07:56:09 host scsi: [ID 107833 kern.notice]     Vendor: ATA              
                  Serial Number:
  
Oct 21 07:56:09 host scsi: [ID 107833 kern.notice]     Sense Key: No Additional 
Sense
Oct 21 07:56:09 host scsi: [ID 107833 kern.notice]     ASC: 0x0 (no additional 
sense info), ASCQ: 0x0, FRU: 0x0d
 
 
This message posted from opensolaris.org
_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Reply via email to