Hello,

this may not apply to your machine.  I have two changes to your setup:
* Opensolaris instead of Nexenta
* DL585G1 instead of your DL380G4

Here's my problem: reproducible crash after a certain time (1:30h in my case).

Explanation: the HP machine has enterprise features (ECC RAM) and performs 
scrubbing of the RAM, just as you could scrub ZFS disks; with the 4 AMD dual 
core CPUs, the memory is divided into 4 chunks and when the scrubber hits a 
hole, then the machine crashes without so much as a crashdump

Solution: add the following to /etc/system

set snooping=1
set pcplusmp:apic_panic_on_nmi=1
set cpu_ms.AuthenticAMD.15:ao_scrub_policy = 1
set cpu_ms.AuthenticAMD.15:ao_scrub_rate_dcache = 0
set cpu_ms.AuthenticAMD.15:ao_scrub_rate_l2cache = 0 
set mc-amd:mc_no_attach=1
set disable_memscrub = 1


Best regards,

Oliver
-- 
This message posted from opensolaris.org
_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Reply via email to