Ben Middleton wrote:
> Hi,
>
> Quick update:
>
> I left memtest running over night - 39 passes, no errors.
>
> I also attempted to force the BIOS to run the memory at 800MHz & 5-5-5-15 as
> suggested - but the machine became very unstable - long boot times;
> PCI-Express failure of Yukon networ
Hi,
Quick update:
I left memtest running over night - 39 passes, no errors.
I also attempted to force the BIOS to run the memory at 800MHz & 5-5-5-15 as
suggested - but the machine became very unstable - long boot times; PCI-Express
failure of Yukon network card on booting etc. I've switched i
On Tue, Jun 10, 2008 at 9:12 AM, Ben Middleton <[EMAIL PROTECTED]> wrote:
> I'll still try a long memtest run, followed by a rebuild of the errored pool.
> I'll have a read around to see if there's anyway of making the memory more
> stable on this mobo.
Run it at 800MHz. I have a MSI P35 Platinu
Hi,
It's an ASUS P5K-WS board with 2Gb of Corsair TwinX DDR2 8500 1066MHz Non-ECC
memory. The board uses the Intel P35 chipset - it also will not support ECC
RAM. TBH, this is probably the last time I'll get an ASUS, as this is the
second board I've got through - the first one died for no parti
On Tue, Jun 10, 2008 at 8:01 AM, Ben Middleton <[EMAIL PROTECTED]> wrote:
> Today's findings are that the cksum errors appear on the new disk on the
> other controller too - so I've ruled out controllers & cables. It's probably
> as Jeff says - just got to figure out now how to prove the memory i
Sent response by private message.
Today's findings are that the cksum errors appear on the new disk on the other
controller too - so I've ruled out controllers & cables. It's probably as Jeff
says - just got to figure out now how to prove the memory is duff.
Ben
This message posted from ope
That's odd -- the only way the 'rm' should fail is if it can't
read the znode for that file. The znode is metadata, and is
therefore stored in two distinct places using ditto blocks.
So even if you had one unlucky copy that was damaged on two
of your disks, you should still have another copy elsew
Hi Marc,
Thanks for all of your suggestions.
I'll restart memtest when I'm next in the office and leave it running overnight.
I can recreate the pool - but I guess the question is am I safe to do this on
the existing setup, or am I going to hit the same issue again sometime?
Assuming I don't f
Ben Middleton drn.org> writes:
>
> Today's update:
> - I ran a memtest a few times - no errors.
Just making sure you know about it: memtest should run for a _least_ a couple
hours, and should complete at least 1 pass.
Also, after the scrub completes, any permanent errors you see (so far you on
Hi,
Today's update:
- I ran a memtest a few times - no errors.
- I reseated, re-routed ad switched all connectors/cables
- I'm currently running a scrub, but it's showing vast numbers of cksum errors
now across all devices:
$ zpool status -v
pool: rpool
state: DEGRADED
status: One or more de
Thanks Marc - I'll run memtest on Monday, and re-seat memory/cpu//cards etc. If
that fails, I'll try moving the devices onto a different SATA controller.
Failing that I'll rebuild from scratch. Failing that, I'll get a new
motherboard!
Ben
This message posted from opensolaris.org
__
Weird. I have no idea how you could remove that file (beside destroying the
entire filesystem)...
One other thing I noticed:
NAMESTATE READ WRITE CKSUM
rpool ONLINE 0 0 8
raidz1ONLINE 0 0 8
c0t7d0 ONLINE
Hi Marc,
$ : > 09 - Check.mp3
bash: 09 - Check.mp3: I/O error
$ cd ..
$ rm -rf BAD
$ rm: cannot remove `BAD/09 - Check.mp3': I/O error
I'll try shuffling the cables - but as you see above it occasionally reports on
a different disk - so imagine the cables are OK. Also, the new disk I added has
Ben Middleton drn.org> writes:
>
> [...]
> But that simply had the effect of transferring the issue to the new drive:
When you see this behavior, it most likely means it's not your drive
which is failing, but instead it indicates a bad SATA/SAS cable, or
port on the disk controller.
PS: have yo
Hello again,
I'm not making progress on this.
Every time I run a zpool scrub rpool I see:
$ zpool status -vx
pool: rpool
state: DEGRADED
status: One or more devices has experienced an error resulting in data
corruption. Applications may be affected.
action: Restore the file in questi
Hi,
I can't seem to delete a file in my zpool that has permanent errors:
zpool status -vx
pool: rpool
state: ONLINE
status: One or more devices has experienced an error resulting in data
corruption. Applications may be affected.
action: Restore the file in question if possible. Other
16 matches
Mail list logo