On a lark, I decided to create a new pool not including any devices 
connected to card #3 (i.e. "c5")

It crashes again, but this time with a slightly different dump (see below)
  - actually, there are two dumps below, the first is using the xVM 
kernel and the second is not

Any ideas?

Kent



[NOTE: this one using xVM kernel - see below for dump without xVM kernel]

# zpool destroy tank
# zpool status
no pools available
# zpool create tank raidz2 c3t0d0 c3t4d0 c4t0d0 c4t4d0 raidz2 c3t1d0 
c3t5d0 c4t1d0 c4t5d0
# zpool status
  pool: tank
 state: ONLINE
 scrub: none requested
config:

        NAME        STATE     READ WRITE CKSUM
        tank        ONLINE       0     0     0
          raidz2    ONLINE       0     0     0
            c3t0d0  ONLINE       0     0     0
            c3t4d0  ONLINE       0     0     0
            c4t0d0  ONLINE       0     0     0
            c4t4d0  ONLINE       0     0     0
          raidz2    ONLINE       0     0     0
            c3t1d0  ONLINE       0     0     0
            c3t5d0  ONLINE       0     0     0
            c4t1d0  ONLINE       0     0     0
            c4t5d0  ONLINE       0     0     0

errors: No known data errors
# ls /tank
# cp -r /usr /tank/usr
Jan 17 08:48:53 san sata: NOTICE: 
/[EMAIL PROTECTED],0/pci10de,[EMAIL PROTECTED]/pci1033,[EMAIL 
PROTECTED]/pci11ab,[EMAIL PROTECTED]:
Jan 17 08:48:53 san  port 5: device reset
Jan 17 08:48:53 san sata: NOTICE: 
/[EMAIL PROTECTED],0/pci10de,[EMAIL PROTECTED]/pci1033,[EMAIL 
PROTECTED]/pci11ab,[EMAIL PROTECTED]:
Jan 17 08:48:53 san  port 5: link lost
Jan 17 08:48:53 san sata: NOTICE: 
/[EMAIL PROTECTED],0/pci10de,[EMAIL PROTECTED]/pci1033,[EMAIL 
PROTECTED]/pci11ab,[EMAIL PROTECTED]:
Jan 17 08:48:53 san  port 5: link established
Jan 17 08:48:55 san marvell88sx: WARNING: marvell88sx1: port 4: DMA 
completed after timed out
Jan 17 08:48:55 san last message repeated 14 times
Jan 17 08:48:55 san sata: NOTICE: 
/[EMAIL PROTECTED],0/pci10de,[EMAIL PROTECTED]/pci1033,[EMAIL 
PROTECTED]/pci11ab,[EMAIL PROTECTED]:
Jan 17 08:48:55 san  port 4: device reset
Jan 17 08:48:55 san sata: NOTICE: 
/[EMAIL PROTECTED],0/pci10de,[EMAIL PROTECTED]/pci1033,[EMAIL 
PROTECTED]/pci11ab,[EMAIL PROTECTED]:
Jan 17 08:48:55 san  port 4: link lost
Jan 17 08:48:55 san sata: NOTICE: 
/[EMAIL PROTECTED],0/pci10de,[EMAIL PROTECTED]/pci1033,[EMAIL 
PROTECTED]/pci11ab,[EMAIL PROTECTED]:
Jan 17 08:48:55 san  port 4: link established
Jan 17 08:48:55 san scsi: WARNING: 
/[EMAIL PROTECTED],0/pci10de,[EMAIL PROTECTED]/pci1033,[EMAIL 
PROTECTED]/pci11ab,[EMAIL PROTECTED]/[EMAIL PROTECTED],0 (sd15):
Jan 17 08:48:55 san     Error for Command: write                   Error 
Level: Retryable
Jan 17 08:48:55 san scsi:       Requested Block: 
11893                     Error Block: 11893
Jan 17 08:48:55 san scsi:       Vendor: 
ATA                                Serial Number:            
Jan 17 08:48:55 san scsi:       Sense Key: No_Additional_Sense
Jan 17 08:48:55 san scsi:       ASC: 0x0 (no additional sense info), 
ASCQ: 0x0, FRU: 0x0
Jan 17 08:48:55 san scsi: WARNING: 
/[EMAIL PROTECTED],0/pci10de,[EMAIL PROTECTED]/pci1033,[EMAIL 
PROTECTED]/pci11ab,[EMAIL PROTECTED]/[EMAIL PROTECTED],0 (sd15):
Jan 17 08:48:55 san     Error for Command: write                   Error 
Level: Retryable
Jan 17 08:48:55 san scsi:       Requested Block: 
11983                     Error Block: 11983
Jan 17 08:48:55 san scsi:       Vendor: 
ATA                                Serial Number:            
Jan 17 08:48:55 san scsi:       Sense Key: No_Additional_Sense
Jan 17 08:48:55 san scsi:       ASC: 0x0 (no additional sense info), 
ASCQ: 0x0, FRU: 0x0
Jan 17 08:48:55 san scsi: WARNING: 
/[EMAIL PROTECTED],0/pci10de,[EMAIL PROTECTED]/pci1033,[EMAIL 
PROTECTED]/pci11ab,[EMAIL PROTECTED]/[EMAIL PROTECTED],0 (sd15):
Jan 17 08:48:55 san     Error for Command: write                   Error 
Level: Retryable
Jan 17 08:48:55 san scsi:       Requested Block: 
12988                     Error Block: 12988
Jan 17 08:48:55 san scsi:       Vendor: 
ATA                                Serial Number:            
Jan 17 08:48:55 san scsi:       Sense Key: No_Additional_Sense
Jan 17 08:48:55 san scsi:       ASC: 0x0 (no additional sense info), 
ASCQ: 0x0, FRU: 0x0
Jan 17 08:48:55 san scsi: WARNING: 
/[EMAIL PROTECTED],0/pci10de,[EMAIL PROTECTED]/pci1033,[EMAIL 
PROTECTED]/pci11ab,[EMAIL PROTECTED]/[EMAIL PROTECTED],0 (sd15):
Jan 17 08:48:55 WARNING: marvell88sx1: error on port 4:
        ATA UDMA data parity error
WARNING: marvell88sx1: error on port 4:
        ATA UDMA data parity error
WARNING: marvell88sx1: error on port 4:
        ATA UDMA data parity error
WARNING: marvell88sx1: error on port 4:
        ATA UDMA data parity error
WARNING: marvell88sx1: error on port 4:
        ATA UDMA data parity error
WARNING: marvell88sx1: error on port 4:
        ATA UDMA data parity error
WARNING: marvell88sx1: error on port 4:
        ATA UDMA data parity error
WARNING: marvell88sx1: error on port 4:
        ATA UDMA data parity error
WARNING: marvell88sx1: error on port 4:
        ATA UDMA data parity error
WARNING: marvell88sx1: error on port 4:
        ATA UDMA data parity error
WARNING: marvell88sx1: error on port 4:
        ATA UDMA data parity error
WARNING: marvell88sx1: error on port 4:
        ATA UDMA data parity error
WARNING: marvell88sx1: error on port 4:
        ATA UDMA data parity error
WARNING: marvell88sx1: error on port 4:
        ATA UDMA data parity error
WARNING: marvell88sx1: error on port 4:
        ATA UDMA data parity error
WARNING: marvell88sx1: error on port 4:
        ATA UDMA data parity error
WARNING: marvell88sx1: error on port 4:
        ATA UDMA data parity error
WARNING: marvell88sx1: error on port 4:
        ATA UDMA data parity error
WARNING: marvell88sx1: error on port 4:
        ATA UDMA data parity error
WARNING: marvell88sx1: error on port 4:
        ATA UDMA data parity error
WARNING: marvell88sx1: error on port 4:
        ATA UDMA data parity error
WARNING: marvell88sx1: error on port 4:
        ATA UDMA data parity error
WARNING: marvell88sx1: error on port 4:
        ATA UDMA data parity error
WARNING: marvell88sx1: error on port 4:
        ATA UDMA data parity error
WARNING: marvell88sx1: error on port 4:
        ATA UDMA data parity error
WARNING: marvell88sx1: error on port 4:
        ATA UDMA data parity error
WARNING: marvell88sx1: error on port 4:
        ATA UDMA data parity error
WARNING: marvell88sx1: error on port 4:
        ATA UDMA data parity error
WARNING: marvell88sx1: error on port 4:
        EDMA self disabled

panic[cpu2]/thread=ffffff000f180c80: BAD TRAP: type=e (#pf Page fault) 
rp=ffffff000f180ab0 addr=0 occurred in module "<unknown>" due to a NULL 
pointer dereference

sched: #pf Page fault
Bad kernel fault at addr=0x0
pid=0, pc=0x0, sp=0xffffff000f180ba8, eflags=0x10246
cr0: 8005003b<pg,wp,ne,et,ts,mp,pe> cr4: 620<xmme,fxsr,pae>
cr2: 0
        rdi: ffffff02d0040380 rsi:                0 rdx: ffffff000f180c80
        rcx:                2  r8:                0  r9:                0
        rax:                0 rbx:                1 rbp: ffffff000f180be0
        r10:      429c7a8d230 r11: fffffffffb81ec40 r12: ffffff02cd25aae8
        r13: ffffff02d0040380 r14: ffffff02cd25aa80 r15: ffffff02cbab3c80
        fsb:                0 gsb: ffffff02c6bb9b00  ds:               4b
         es:               4b  fs:                0  gs:              1c3
        trp:                e err:               10 rip:                0
         cs:             e030 rfl:            10246 rsp: ffffff000f180ba8
         ss:             e02b

ffffff000f180990 unix:die+c8 ()
ffffff000f180aa0 unix:trap+13b3 ()
ffffff000f180ab0 unix:cmntrap+12f ()
ffffff000f180be0 0 ()
ffffff000f180c30 unix:av_dispatch_softvect+5f ()
ffffff000f180c60 unix:dispatch_softint+38 ()
ffffff000f13c9a0 unix:switch_sp_and_call+13 ()
ffffff000f13c9e0 unix:dosoftint+59 ()
ffffff000f13ca30 unix:do_interrupt+f9 ()
ffffff000f13cae0 unix:xen_callback_handler+370 ()
ffffff000f13caf0 unix:xen_callback+cd ()
ffffff000f13cbf0 unix:HYPERVISOR_sched_op+29 ()
ffffff000f13cc00 unix:HYPERVISOR_block+11 ()
ffffff000f13cc10 unix:mach_cpu_idle+12 ()
ffffff000f13cc40 unix:cpu_idle+cc ()
ffffff000f13cc60 unix:idle+10e ()
ffffff000f13cc70 unix:thread_start+8 ()

syncing file systems... 1 1 done
dumping to /dev/dsk/c2t0d0s1, offset 215547904, content: kernel
NOTICE: /[EMAIL PROTECTED],0/pci15d9,[EMAIL PROTECTED]:
 port 0: device reset

100% done: 143600 pages dumped, compression ratio 3.32, dump succeeded
rebooting...






[NOTE: this one using the standard kernel - not the xVM kernel]


# cp -r /usr /tank/usr
WARNING: marvell88sx1: error on port 1:
        ATA UDMA data parity error
WARNING: marvell88sx1: error on port 1:
        ATA UDMA data parity error
WARNING: marvell88sx1: error on port 1:
        ATA UDMA data parity error
WARNING: marvell88sx1: error on port 1:
        ATA UDMA data parity error
WARNING: marvell88sx1: error on port 1:
        ATA UDMA data parity error
WARNING: marvell88sx1: error on port 1:
        ATA UDMA data parity error

SUNW-MSG-ID: SUNOS-8000-0G, TYPE: Error, VER: 1, SEVERITY: Major
EVENT-TIME: 0x478f5e05.0x288802d5 (0x4504536150)
PLATFORM: i86pc, CSN: -, HOSTNAME: san
SOURCE: SunOS, REV: 5.11 snv_78
DESC: Errors have been detected that require a reboot to ensure system
integrity.  See http://www.sun.com/msg/SUNOS-8000-0G for more information.
AUTO-RESPONSE: Solaris will attempt to save and diagnose the error telemetry
IMPACT: The system will sync files, save a crash dump if needed, and reboot
REC-ACTION: Save the error summary below in case telemetry cannot be saved


panic[cpu3]/thread=ffffff000f81ac80: pcie_pci-0: PCI(-X) Express Fatal Error

ffffff000f81abc0 pcie_pci:pepb_err_msi_intr+d2 ()
ffffff000f81ac20 unix:av_dispatch_autovect+78 ()
ffffff000f81ac60 unix:dispatch_hardint+2f ()
ffffff000f7e4ac0 unix:switch_sp_and_call+13 ()
ffffff000f7e4b10 unix:do_interrupt+a0 ()
ffffff000f7e4b20 unix:cmnint+ba ()
ffffff000f7e4c10 unix:mach_cpu_idle+b ()
ffffff000f7e4c40 unix:cpu_idle+c8 ()
ffffff000f7e4c60 unix:idle+10e ()
ffffff000f7e4c70 unix:thread_start+8 ()

syncing file systems... done
ereport.io.pciex.rc.fe-msg ena=450450261f00c01 detector=[ version=0 scheme=
 "dev" device-path="/[EMAIL PROTECTED],0/pci10de,[EMAIL PROTECTED]" ] 
rc-status=800007c 
source-id=200
 source-valid=1

ereport.io.pciex.rc.mue-msg ena=450450261f00c01 detector=[ version=0 scheme=
 "dev" device-path="/[EMAIL PROTECTED],0/pci10de,[EMAIL PROTECTED]" ] 
rc-status=800007c

ereport.io.pci.sec-rserr ena=450450261f00c01 detector=[ version=0 
scheme="dev"
 device-path="/[EMAIL PROTECTED],0/pci10de,[EMAIL PROTECTED]" ] 
pci-sec-status=6000 pci-bdg-ctrl=3

ereport.io.pci.sec-ma ena=450450261f00c01 detector=[ version=0 scheme="dev"
 device-path="/[EMAIL PROTECTED],0/pci10de,[EMAIL PROTECTED]" ] 
pci-sec-status=6000 pci-bdg-ctrl=3

ereport.io.pciex.bdg.sec-perr ena=450450261f00c01 detector=[ version=0 
scheme=
 "dev" device-path="/[EMAIL PROTECTED],0/pci10de,[EMAIL 
PROTECTED]/pci1033,[EMAIL PROTECTED]" ] sue-status=1800
 source-id=200 source-valid=1

ereport.io.pciex.bdg.sec-serr ena=450450261f00c01 detector=[ version=0 
scheme=
 "dev" device-path="/[EMAIL PROTECTED],0/pci10de,[EMAIL 
PROTECTED]/pci1033,[EMAIL PROTECTED]" ] sue-status=1800

ereport.io.pci.sec-rserr ena=450450261f00c01 detector=[ version=0 
scheme="dev"
 device-path="/[EMAIL PROTECTED],0/pci10de,[EMAIL PROTECTED]/pci1033,[EMAIL 
PROTECTED]" ] pci-sec-status=6420
 pci-bdg-ctrl=7

dumping to /dev/dsk/c2t0d0s1, offset 215547904, content: kernel
NOTICE: /[EMAIL PROTECTED],0/pci15d9,[EMAIL PROTECTED]:
 port 0: device reset

100% done: 152687 pages dumped, compression ratio 5.33, dump succeeded
rebooting...





_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Reply via email to