Thanks for the note Anton.  I let memtest86 run overnight and it found 
no issues.  I've also now moved the cards around and have confirmed that 
slot #3 on the mobo is bad (all my aoc-sat2-mv8 cards, cables, and 
backplanes are OK).

However, I think its more than just slot #3 that has a fault because 
when I have all three cards plugged into mobo slots other than #3, they 
all work fine individually, but when I run the exact same per-card tests 
in parallel, the system crashes.

I'm now going to have the system integrator that built my system send me 
a new mobo  (ugh!)

Thanks again,
Kent


Anton B. Rang wrote:
> Definitely a hardware problem (possibly compounded by a bug).  Some key 
> phrases and routines:
>
>   ATA UDMA data parity error
>
> This one actually looks like a misnomer.  At least, I'd normally expect "data 
> parity error" not to crash the system!  (It should result in a retry or EIO.)
>
>   PCI(-X) Express Fatal Error
>
> This one's more of an issue -- it indicates that the PCI Express bus had an 
> error.
>
>   pcie_pci:pepb_err_msi_intr
>
> This indicates an error on the PCI bus which has been reflected through to 
> the PCI Express bus. There should be more detail, but it's hard to figure it 
> out from what's below. (The report is showing multiple errors, including both 
> parity errors & system errors, which seems unlikely unless there's a hardware 
> design flaw or a software bug.)
>
> Others have suggested the power supply or memory, but in my experience these 
> types of errors are more often due to a faulty system backplane or card (and 
> occasionally a bad bridge chip).
>  
>  
> This message posted from opensolaris.org
> _______________________________________________
> zfs-discuss mailing list
> zfs-discuss@opensolaris.org
> http://mail.opensolaris.org/mailman/listinfo/zfs-discuss
>
>
>   

_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Reply via email to