[zfs-discuss] Apparent SAS HBA failure-- now what?

2010-11-06 Thread Dave Pooser
My setup: A SuperMicro 24-drive chassis with Intel dual-processor motherboard, three LSI SAS3081E controllers, and 24 SATA 2TB hard drives, divided into three pools with each pool a single eight-disk RAID-Z2. (Boot is an SSD connected to motherboard SATA.) This morning I got a cheerful email from

Re: [zfs-discuss] Apparent SAS HBA failure-- now what?

2010-11-06 Thread Khushil Dep
Can you send output of iostat -xCzn as well as fmadm faulty please? Is. This an E2 chassis? Are you using interposers? On 6 Nov 2010 18:28, "Dave Pooser" wrote: My setup: A SuperMicro 24-drive chassis with Intel dual-processor motherboard, three LSI SAS3081E controllers, and 24 SATA 2TB hard dri

Re: [zfs-discuss] Apparent SAS HBA failure-- now what?

2010-11-06 Thread Dave Pooser
On 11/6/10 Nov 6, 1:35 PM, "Khushil Dep" wrote: > Is this an E2 chassis? Are you using interposers? No, it¹s an SC846A chassis. There are no interposers or expanders; six SFF-8087 ³iPass² cables go from ports on the HBA to ports on the backplane. > Can you send output of iostat -xCzn as well a

Re: [zfs-discuss] Apparent SAS HBA failure-- now what?

2010-11-06 Thread Khushil Dep
Sorry u meant iostat -En I'm looking for errors On 6 Nov 2010 18:56, "Dave Pooser" wrote: On 11/6/10 Nov 6, 1:35 PM, "Khushil Dep" wrote: > Is this an E2 chassis? Are you using interposers? No, it¹s an SC846A chassis. There are no interposers or expanders; six SFF-8087 ³iPass² cables go

Re: [zfs-discuss] Apparent SAS HBA failure-- now what?

2010-11-06 Thread Dave Pooser
On 11/6/10 Nov 6, 2:21 PM, "Khushil Dep" wrote: > Sorry I meant iostat -En I'm looking for errors # iostat -En c8d0 Soft Errors: 0 Hard Errors: 0 Transport Errors: 0 Model: IMATION-MAC25-0 Revision: Serial No: 87A0079B1808000 Size: 63.89GB <63887523840 bytes> Media Error: 0 Dev

Re: [zfs-discuss] Apparent SAS HBA failure-- now what?

2010-11-06 Thread Khushil Dep
Similar to what I've seen before, SATA disks in a 846 chassis with hardware and transport errors. Though in that occasion it was an E2 chassis with interposers. How long has this system been up? Is it production or can you offline and check all firmware on lsi controllers are up to date and match e

Re: [zfs-discuss] Apparent SAS HBA failure-- now what?

2010-11-06 Thread Dave Pooser
On 11/6/10 Nov 6, 2:35 PM, "Khushil Dep" wrote: > Similar to what I've seen before, SATA disks in a 846 chassis with hardware > and transport errors. Though in that occasion it was an E2 chassis with > interposers. How long has this system been up? Is it production or can you > offline and check

Re: [zfs-discuss] Apparent SAS HBA failure-- now what?

2010-11-06 Thread Khushil Dep
The fmdump will let you get the serial of one disk and id the controller its on so you can swap it out and check. On 6 Nov 2010 19:45, "Dave Pooser" wrote: On 11/6/10 Nov 6, 2:35 PM, "Khushil Dep" wrote: > Similar to what I've seen... It's been up for about 6 months. I can offline them. > Do

Re: [zfs-discuss] [OpenIndiana-discuss] format dumps the core

2010-11-06 Thread Jürgen Keil
> r...@tos-backup:~# pstack /dev/rdsk/core > core '/dev/rdsk/core' of 1217: format > fee62e4a UDiv (4, 0, 8046c80, 80469a0, 8046a30, 8046a50) + 2a > 08079799 auto_sense (4, 0, 8046c80, 0) + 281 > ... Seems that one function call is missing in the back trace between auto_sense and UDiv, because U

Re: [zfs-discuss] Apparent SAS HBA failure-- now what?

2010-11-06 Thread McBofh
On 7/11/10 04:27 AM, Dave Pooser wrote: My setup: A SuperMicro 24-drive chassis with Intel dual-processor motherboard, three LSI SAS3081E controllers, and 24 SATA 2TB hard drives, divided into three pools with each pool a single eight-disk RAID-Z2. (Boot is an SSD connected to motherboard SATA.)