On Tue, Apr 21, 2009 at 09:34:57AM -0500, Patrick Skerrett wrote:
> I'm fighting with an identical problem here & am very interested in this 
> thread.
> 
> Solaris 10 127112-11 boxes running ZFS on a fiberchannel raid5 device 
> (hardware raid).

You are about a year behind in kernel patches.  There is one patch
that addresses similar problems.  I'd recommend installing all of
the new patches.  This bug seems to be relevant:

    http://bugs.opensolaris.org/bugdatabase/view_bug.do?bug_id=6535160

> Randomly one lun on a machine will stop writing for about 10-15 minutes 
> (during a busy time of day), and then all of a sudden become active with 
> a burst of activity. Reads will continue to happen.

One thing that seems to have solved our hang and stall problems is
to set `pg_contig_disable=1' in the kernel.  I believe that only
systems with Niagara CPUs are affected.  It has to do with kernel
code for handling two different sizes of memory pages.  You can find
more information here:

    http://forums.sun.com/thread.jspa?threadID=5257060

Also, open a support case with Sun if you haven't already.

-- 
-Gary Mills-    -Unix Support-    -U of M Academic Computing and Networking-
_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Reply via email to