Re: [zfs-discuss] Cannot delete errored file

2008-06-13 Thread Jonathan Loran
Ben Middleton wrote: > Hi, > > Quick update: > > I left memtest running over night - 39 passes, no errors. > > I also attempted to force the BIOS to run the memory at 800MHz & 5-5-5-15 as > suggested - but the machine became very unstable - long boot times; > PCI-Express failure of Yukon networ

Re: [zfs-discuss] Cannot delete errored file

2008-06-13 Thread Ben Middleton
Hi, Quick update: I left memtest running over night - 39 passes, no errors. I also attempted to force the BIOS to run the memory at 800MHz & 5-5-5-15 as suggested - but the machine became very unstable - long boot times; PCI-Express failure of Yukon network card on booting etc. I've switched i

Re: [zfs-discuss] Cannot delete errored file

2008-06-10 Thread Brandon High
On Tue, Jun 10, 2008 at 9:12 AM, Ben Middleton <[EMAIL PROTECTED]> wrote: > I'll still try a long memtest run, followed by a rebuild of the errored pool. > I'll have a read around to see if there's anyway of making the memory more > stable on this mobo. Run it at 800MHz. I have a MSI P35 Platinu

Re: [zfs-discuss] Cannot delete errored file

2008-06-10 Thread Ben Middleton
Hi, It's an ASUS P5K-WS board with 2Gb of Corsair TwinX DDR2 8500 1066MHz Non-ECC memory. The board uses the Intel P35 chipset - it also will not support ECC RAM. TBH, this is probably the last time I'll get an ASUS, as this is the second board I've got through - the first one died for no parti

Re: [zfs-discuss] Cannot delete errored file

2008-06-10 Thread Brandon High
On Tue, Jun 10, 2008 at 8:01 AM, Ben Middleton <[EMAIL PROTECTED]> wrote: > Today's findings are that the cksum errors appear on the new disk on the > other controller too - so I've ruled out controllers & cables. It's probably > as Jeff says - just got to figure out now how to prove the memory i

Re: [zfs-discuss] Cannot delete errored file

2008-06-10 Thread Ben Middleton
Sent response by private message. Today's findings are that the cksum errors appear on the new disk on the other controller too - so I've ruled out controllers & cables. It's probably as Jeff says - just got to figure out now how to prove the memory is duff. Ben This message posted from ope

Re: [zfs-discuss] Cannot delete errored file

2008-06-10 Thread Jeff Bonwick
That's odd -- the only way the 'rm' should fail is if it can't read the znode for that file. The znode is metadata, and is therefore stored in two distinct places using ditto blocks. So even if you had one unlucky copy that was damaged on two of your disks, you should still have another copy elsew

Re: [zfs-discuss] Cannot delete errored file

2008-06-10 Thread Ben Middleton
Hi Marc, Thanks for all of your suggestions. I'll restart memtest when I'm next in the office and leave it running overnight. I can recreate the pool - but I guess the question is am I safe to do this on the existing setup, or am I going to hit the same issue again sometime? Assuming I don't f

Re: [zfs-discuss] Cannot delete errored file

2008-06-09 Thread Marc Bevand
Ben Middleton drn.org> writes: > > Today's update: > - I ran a memtest a few times - no errors. Just making sure you know about it: memtest should run for a _least_ a couple hours, and should complete at least 1 pass. Also, after the scrub completes, any permanent errors you see (so far you on

Re: [zfs-discuss] Cannot delete errored file

2008-06-09 Thread Ben Middleton
Hi, Today's update: - I ran a memtest a few times - no errors. - I reseated, re-routed ad switched all connectors/cables - I'm currently running a scrub, but it's showing vast numbers of cksum errors now across all devices: $ zpool status -v pool: rpool state: DEGRADED status: One or more de

Re: [zfs-discuss] Cannot delete errored file

2008-06-07 Thread Ben Middleton
Thanks Marc - I'll run memtest on Monday, and re-seat memory/cpu//cards etc. If that fails, I'll try moving the devices onto a different SATA controller. Failing that I'll rebuild from scratch. Failing that, I'll get a new motherboard! Ben This message posted from opensolaris.org __

Re: [zfs-discuss] Cannot delete errored file

2008-06-07 Thread Marc Bevand
Weird. I have no idea how you could remove that file (beside destroying the entire filesystem)... One other thing I noticed: NAMESTATE READ WRITE CKSUM rpool ONLINE 0 0 8 raidz1ONLINE 0 0 8 c0t7d0 ONLINE

Re: [zfs-discuss] Cannot delete errored file

2008-06-05 Thread Ben Middleton
Hi Marc, $ : > 09 - Check.mp3 bash: 09 - Check.mp3: I/O error $ cd .. $ rm -rf BAD $ rm: cannot remove `BAD/09 - Check.mp3': I/O error I'll try shuffling the cables - but as you see above it occasionally reports on a different disk - so imagine the cables are OK. Also, the new disk I added has

Re: [zfs-discuss] Cannot delete errored file

2008-06-05 Thread Marc Bevand
Ben Middleton drn.org> writes: > > [...] > But that simply had the effect of transferring the issue to the new drive: When you see this behavior, it most likely means it's not your drive which is failing, but instead it indicates a bad SATA/SAS cable, or port on the disk controller. PS: have yo

Re: [zfs-discuss] Cannot delete errored file

2008-06-05 Thread Ben Middleton
Hello again, I'm not making progress on this. Every time I run a zpool scrub rpool I see: $ zpool status -vx pool: rpool state: DEGRADED status: One or more devices has experienced an error resulting in data corruption. Applications may be affected. action: Restore the file in questi

[zfs-discuss] Cannot delete errored file

2008-06-03 Thread Ben Middleton
Hi, I can't seem to delete a file in my zpool that has permanent errors: zpool status -vx pool: rpool state: ONLINE status: One or more devices has experienced an error resulting in data corruption. Applications may be affected. action: Restore the file in question if possible. Other