Jay wrote:
> hi *,
>
> i'm currently playing around with the setup of an opensolaris server as
> home nas and am experiencing occasional read/write problems with the 
> zfs pool.
>
> the short version (details below/attached):
> * 6-disk raidz pool attached to the sata controller on an nvidia MCP78S 
> chipset
> * first scrub of the pool with some data on it marks device sd5 as faulted
>   due to "WARNING: ahci0: watchdog port 5 satapkt 0xffffff01c7d8d660 timed 
> out"
>   and a plethora of "Error for Command: read(10)" (see attached messages)
>   

Jay,
if you search the bugs database for this error message,
    http://bugs.opensolaris.org
you will find a number of hits.  Many possibly related bugs
have been fixed by b101, but there may be more.

You should also ask this question on the drivers-discuss forum
as that is where the device driver writers hang out.
 -- richard

> * these messages appeared also for sd1, sd2 and sd3, but only sd5 failed in 
> the end
> * replaced the disk, resilvering started
> * the same timeouts appear for sd0 and sd1 while resilvering, to prevent the 
> pool from
>   failing completely, i (rather brute force) rebooted the machine
> * resilvering ends eventually, data seems intact
> * everything seems normal for a few days, reading/writing is ok, no errors 
> show up, the
>   data is accessible
>
> today, i saw the same errors reported for sd4 in the logfile and when trying 
> a 'zpool status'
> it became unresponsive, with timeouts showing up for sd0. after another 
> reboot, everything still looks ok, zpool status is ok, read and write access 
> are ok. 
>
> the disks themselves should be ok, i had them running a burn-in before 
> installing opensolaris and the WD diagnostics passed them - even the faulted 
> one i replaced passed another test as being perfectly ok.
>
> can anybody shed some light on this? i'm guessing it's related to the sata 
> controller, but i'd appreciate any help or insight.
>
> (at the moment, i'm not really worried about data loss as you might guess 
> from the brute
> force rebooting, all the data on the pool is also stored on an old linux 
> machine. i'm reacquainting myself with solaris, so it's more or less a 
> playground for now. but i'd like to replace the old linux server sometime - 
> mainly because of zfs)
>
> thanks, 
> jay
>
>   

_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Reply via email to