Package: linux-image-2.6.18-5-686
Version: 2.6.18.dfsg.1-13etch4
Severity: important

We are experiencing completely reproducible problems accessing S-ATA
drives, connected to one or more Promise SX8 S-ATA controllers.  The
controller is recognized by the default Debian kernel, and the attached
drives appear as the devices under /dev/sx8/.  When we create a
reasonably high load concurrently for more than one drive (such as
running two simultaneous "mkfs" processes), the processes lock up
indefinitely, and the load average starts climbing to approximately 2xN,
where N is the number of disks, accessed simultaneously.  After such
lock-up, the drives are completely inaccessible.  

The problem can be reproduced with one or two cards on the same system.  
We have three SX8 controllers altogether, and two identical motherboards, 
and the problem can be reproduced with all combinations of these.  We don't
pass any boot-time kernel options to the SX8 driver, and we have tagged
commands disabled in the firmware configuration.  The SX8 cards are the
only add-on cards on the system.

We found a work around for ourselves: we noted that, if we boot with a
Knoppix 5.0.1 CD-ROM, the above problem does not appear.  So we compiled
a custom kernel (2.6.23.1 from kernel.org), using the .config file from
the kernel that's on Knoppix 5.0.1 CD.  This worked: the problem cannot
be reproduced with that kernel on the same system.

We would very much prefer to use the stock Debian kernel, though.

The Promise SX8 cards we are using have been updated to the latest
firmware.  The motherboard is the SuperMicro PDSME, also flashed to the
latest BIOS.  Here are product information pages:
http://www.promise.com/product/product_detail_eng.asp?product_id=125
http://www.supermicro.com/products/motherboard/PD/E7230/PDSME.cfm

I'll paste the output of lspci and a snippet of `dmesg' in the end of
this report.

For your information, I found this open CentOS bug report: 
  http://bugs.centos.org/view.php?id=2355
This looks exactly like the problem we are experiencing.

p.s. I am not sure whether I can attach the working kernel configuration
file to this message: I'm using the reportbug command and so far it has
not offered to attach a file.  Should I just paste it in?

The output of lspci:
00:00.0 Host bridge: Intel Corporation E7230 Memory Controller Hub (rev 81)
00:01.0 PCI bridge: Intel Corporation E7230 PCI Express Root Port (rev 81)
00:1c.0 PCI bridge: Intel Corporation 82801G (ICH7 Family) PCI Express Port 1 
(rev 01)
00:1c.4 PCI bridge: Intel Corporation 82801GR/GH/GHM (ICH7 Family) PCI Express 
Port 5 (rev 01)
00:1c.5 PCI bridge: Intel Corporation 82801GR/GH/GHM (ICH7 Family) PCI Express 
Port 6 (rev 01)
00:1d.0 USB Controller: Intel Corporation 82801G (ICH7 Family) USB UHCI #1 (rev 
01)
00:1d.1 USB Controller: Intel Corporation 82801G (ICH7 Family) USB UHCI #2 (rev 
01)
00:1d.2 USB Controller: Intel Corporation 82801G (ICH7 Family) USB UHCI #3 (rev 
01)
00:1d.3 USB Controller: Intel Corporation 82801G (ICH7 Family) USB UHCI #4 (rev 
01)
00:1d.7 USB Controller: Intel Corporation 82801G (ICH7 Family) USB2 EHCI 
Controller (rev 01)
00:1e.0 PCI bridge: Intel Corporation 82801 PCI Bridge (rev e1)
00:1f.0 ISA bridge: Intel Corporation 82801GB/GR (ICH7 Family) LPC Interface 
Bridge (rev 01)
00:1f.1 IDE interface: Intel Corporation 82801G (ICH7 Family) IDE Controller 
(rev 01)
00:1f.2 IDE interface: Intel Corporation 82801GB/GR/GH (ICH7 Family) Serial ATA 
Storage Controller IDE (rev 01)
00:1f.3 SMBus: Intel Corporation 82801G (ICH7 Family) SMBus Controller (rev 01)
01:00.0 PCI bridge: Intel Corporation 6700PXH PCI Express-to-PCI Bridge A (rev 
09)
01:00.1 PIC: Intel Corporation 6700/6702PXH I/OxAPIC Interrupt Controller A 
(rev 09)
01:00.2 PCI bridge: Intel Corporation 6700PXH PCI Express-to-PCI Bridge B (rev 
09)
01:00.3 PIC: Intel Corporation 6700PXH I/OxAPIC Interrupt Controller B (rev 09)
03:01.0 Mass storage controller: Promise Technology, Inc. SATAII150 SX8 (rev 02)
03:03.0 Mass storage controller: Promise Technology, Inc. SATAII150 SX8 (rev 02)
05:00.0 Ethernet controller: Intel Corporation 82573E Gigabit Ethernet 
Controller (Copper) (rev 03)
06:00.0 Ethernet controller: Intel Corporation 82573L Gigabit Ethernet 
Controller
0a:04.0 VGA compatible controller: ATI Technologies Inc Rage XL (rev 27)

Snippet of `dmesg':
x8 version 1.0
ACPI: PCI Interrupt 0000:03:01.0[A] -> GSI 48 (level, low) -> IRQ 58
sx8(0000:03:01.0): found 5 interesting devices
sx8(0000:03:01.0): port 0 device 781422768 sectors
sx8(0000:03:01.0): port 0 device "ST3400633AS"
sx8(0000:03:01.0): port 1 device 781422768 sectors
sx8(0000:03:01.0): port 1 device "ST3400633AS"
sx8(0000:03:01.0): port 2 device 781422768 sectors
sx8(0000:03:01.0): port 2 device "ST3400633AS"
sx8(0000:03:01.0): port 3 device 781422768 sectors
sx8(0000:03:01.0): port 3 device "ST3400633AS"
sx8(0000:03:01.0): port 4 device 781422768 sectors
sx8(0000:03:01.0): port 4 device "ST3400633AS"
 sx8/0: p1
 sx8/1: p1
 sx8/2: p1
 sx8/3:<6>ACPI: PCI Interrupt 0000:00:1f.3[B] -> GSI 19 (level, low) -> IRQ 233
 p1
 sx8/4: p1
sx8(0000:03:01.0): 5 ports activated
sx80: pci 0000:03:01.0, ports 8, io ed300000, irq 58, major 160
ACPI: PCI Interrupt 0000:03:03.0[A] -> GSI 52 (level, low) -> IRQ 66
sx8(0000:03:03.0): found 0 interesting devices
sx8(0000:03:03.0): 0 ports activated
sx81: pci 0000:03:03.0, ports 8, io ed400000, irq 66, major 161

Please let me know if you require any more information.

-- System Information:
Debian Release: 4.0
  APT prefers stable
  APT policy: (500, 'stable')
Architecture: i386 (i686)
Shell:  /bin/sh linked to /bin/bash
Kernel: Linux 2.6.18-5-686
Locale: LANG=C, LC_CTYPE=C (charmap=ANSI_X3.4-1968)

Versions of packages linux-image-2.6.18-5-686 depends on:
ii  coreutils                     5.97-5.3   The GNU core utilities
ii  debconf [debconf-2.0]         1.5.11     Debian configuration management sy
ii  initramfs-tools [linux-initra 0.85h      tools for generating an initramfs
ii  module-init-tools             3.3-pre4-2 tools for managing Linux kernel mo

Versions of packages linux-image-2.6.18-5-686 recommends:
pn  libc6-i686                    <none>     (no description available)

-- debconf information:
  linux-image-2.6.18-5-686/preinst/failed-to-move-modules-2.6.18-5-686:
  linux-image-2.6.18-5-686/postinst/create-kimage-link-2.6.18-5-686: true
  linux-image-2.6.18-5-686/preinst/initrd-2.6.18-5-686:
  shared/kernel-image/really-run-bootloader: true
  linux-image-2.6.18-5-686/preinst/lilo-has-ramdisk:
  linux-image-2.6.18-5-686/preinst/abort-install-2.6.18-5-686:
  linux-image-2.6.18-5-686/postinst/old-system-map-link-2.6.18-5-686: true
  linux-image-2.6.18-5-686/postinst/kimage-is-a-directory:
  linux-image-2.6.18-5-686/preinst/elilo-initrd-2.6.18-5-686: true
  linux-image-2.6.18-5-686/postinst/bootloader-test-error-2.6.18-5-686:
  linux-image-2.6.18-5-686/preinst/abort-overwrite-2.6.18-5-686:
  linux-image-2.6.18-5-686/prerm/would-invalidate-boot-loader-2.6.18-5-686: true
  linux-image-2.6.18-5-686/postinst/depmod-error-initrd-2.6.18-5-686: false
  linux-image-2.6.18-5-686/prerm/removing-running-kernel-2.6.18-5-686: true
  linux-image-2.6.18-5-686/postinst/bootloader-error-2.6.18-5-686:
  linux-image-2.6.18-5-686/postinst/depmod-error-2.6.18-5-686: false
  linux-image-2.6.18-5-686/preinst/already-running-this-2.6.18-5-686:
  linux-image-2.6.18-5-686/preinst/lilo-initrd-2.6.18-5-686: true
  linux-image-2.6.18-5-686/preinst/overwriting-modules-2.6.18-5-686: true
  linux-image-2.6.18-5-686/preinst/bootloader-initrd-2.6.18-5-686: true
  linux-image-2.6.18-5-686/postinst/old-dir-initrd-link-2.6.18-5-686: true
  linux-image-2.6.18-5-686/postinst/old-initrd-link-2.6.18-5-686: true



-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]

Reply via email to