Re: [zfs-discuss] Recurring checksum errors on RAIDZ2 vdev

Ian Collins Mon, 24 Jan 2011 14:06:30 -0800

 On 01/25/11 06:52 AM, Ashley Nicholls wrote:

Hello all,
I'm having a problem that I find difficult to diagnose.
I have an IBM x3550 M3 running nexenta core platform 3.0.1 (134f) with7x6 disk RAIDZ2 vdevs (see listing at bottom).Every day a disk fails with "Too many checksum errors", is marked asdegraded and rebuilt onto a hot spare. I've been doing 'zpool detachzpool002 <degraded disk>' to remove it from the zpool and return thepools status to 'ONLINE'. Later that day (or sometimes the next day),a disk is marked as degraded due to checksum errors and is rebuiltonto a hot spare again, rinse, repeat.
We've been logging this stuff for the past few days and there are afew things to notice however:1. The disk that fails appears to be the hot spare that we rebuilt onto the previous time2. If I don't detach the degraded disk then the newly rebuilt hotspare does not seem to fail
I'm just doing a scrub now to confirm there are no further checksumerrors and then I will detach the 'degraded' drive from the pool andsee if the new hot spare fails in the next 24 hours. Just wondering ifanyone had seen this before?

I used to see these all the time on a Thumper. They magically vanishedwhen I upgraded the drive firmware.


Check to see if your drives are up to date.

--
Ian.

_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] Recurring checksum errors on RAIDZ2 vdev

Reply via email to