Re: [zfs-discuss] Mysterious corruption with raidz2 vdev

2007-07-30 Thread Jeff Bonwick
I suspect this is a bug in raidz error reporting. With a mirror, each copy either checksums correctly or it doesn't, so we know which drives gave us bad data. With RAID-Z, we have to infer which drives have damage. If the number of drives returning bad data is less than or equal to the number of

Re: [zfs-discuss] Mysterious corruption with raidz2 vdev

2007-07-30 Thread Richard Elling
Kevin wrote: > We'll try running all of the diagnostic tests to rule out any other issues. Does the server have ECC memory? Many x86 systems do not :-( > But my question is, wouldn't I need to see at least 3 checksum errors on the > individual devices in order for there to be a visible error in

Re: [zfs-discuss] Mysterious corruption with raidz2 vdev

2007-07-30 Thread Kevin
We'll try running all of the diagnostic tests to rule out any other issues. But my question is, wouldn't I need to see at least 3 checksum errors on the individual devices in order for there to be a visible error in the top level vdev? There doesn't appear to be enough raw checksum errors on the

Re: [zfs-discuss] Mysterious corruption with raidz2 vdev (1 checksum err on disk, 2 on vdev?)

2007-07-29 Thread Tomas Ă–gren
On 28 July, 2007 - Marc Bevand sent me these 0,7K bytes: > Matthew Ahrens sun.com> writes: > > > > So the errors on the raidz2 vdev indeed indicate that at least 3 disks > > below > > it gave the wrong data for a those 2 blocks; we just couldn't tell which 3+ > > disks they were. > > Somethi

Re: [zfs-discuss] Mysterious corruption with raidz2 vdev (1 checksum err on disk, 2 on vdev?)

2007-07-27 Thread Marc Bevand
Matthew Ahrens sun.com> writes: > > So the errors on the raidz2 vdev indeed indicate that at least 3 disks below > it gave the wrong data for a those 2 blocks; we just couldn't tell which 3+ > disks they were. Something must be seriously wrong with this server. This is the first time I see an

Re: [zfs-discuss] Mysterious corruption with raidz2 vdev (1 checksum err on disk, 2 on vdev?)

2007-07-27 Thread Matthew Ahrens
Kevin wrote: > After a scrub of a pool with 3 raidz2 vdevs (each with 5 disks in them) I see > the following status output. Notice that the raidz2 vdev has 2 checksum > errors, but only one disk inside the raidz2 vdev has a checksum error. How is > this possible? I thought that you would have to

Re: [zfs-discuss] Mysterious corruption with raidz2 vdev (1 checksum err on disk, 2 on vd

2007-07-25 Thread Kevin
Here's some additional output from the zpool and zfs tools: $ zpool list NAMESIZEUSED AVAILCAP HEALTH ALTROOT tank 10.2T 8.58T 1.64T83% ONLINE - $ zfs list NAME USED AVAIL REFER MOUNTPOINT tank

[zfs-discuss] Mysterious corruption with raidz2 vdev (1 checksum err on disk, 2 on vdev?)

2007-07-25 Thread Kevin
After a scrub of a pool with 3 raidz2 vdevs (each with 5 disks in them) I see the following status output. Notice that the raidz2 vdev has 2 checksum errors, but only one disk inside the raidz2 vdev has a checksum error. How is this possible? I thought that you would have to have 3 errors in the