d_elbracht wrote:
> we are trying to diagnose errors seen on 6.2, SMP, amd64, cvsup'ed of
> 2007-10-09
> 
> Mainboard is a Tyan Thunder h2000M (S3992-E) with 16 GB RAM and 2 x Opteron
> 2216, da3 is on a 3ware 9550-12
> 
> we are seeing this error:
> g_vfs_done():da3s1a[READ(offset=81064794762854400, length=8192)]error = 5
> on a 12 GB Hyperdrive
> 
> the offset changes sometimes, but it is always 81064794xxxxxxxxx and well
> out the 12GB range.

Yes.

> According to systat -vm, da3 does tps > 500 (yes, that's a lot)

That's not a lot :) That's actually low for a modern solid state drive.

> This leads to an assumption, the error has to do with very high IOs per
> second on a SMP machine.

Either that or file system errors. Does fsck run ok or does it say
anything unusual?

There are several theoretical reasons for such errors that are connected
with the fact you use solid state drives, but all are tricky to diagnose
if you don't have a certain repeatable test you can try. For example:
some SSDs optimize writes to "spread out" the IO on the chips, but some
do it by looking into file system structures to determine where it's
safe to relocate the write - obviously this works only with a known and
supported file system. This is a really wild guess, but maybe the SSD
firmware has error somewhere in this area, trying to interpret UFS as it
was FAT? If you manage to get a repeatable failure test, you can try
formatting the drive as FAT32 and trying it on that.

Or maybe it's just a bad drive...

> The system-disk is a RAID1 on an ICP 5805. All other disks (51) are 20
> gstripe'd partitions.

51 drives and 20 partitions?

Attachment: signature.asc
Description: OpenPGP digital signature

Reply via email to