Public bug reported:

Binary package hint: linux-source-2.6.22

Fresh install of the latest Ubuntu 7.10 Server i386 (Gutsy), with all
fixes applied.

uname -a:
Linux house 2.6.22-14-server #1 SMP Sun Oct 14 23:34:23 GMT 2007 i686 GNU/Linux

System is a Jetway J7F4, with VIA raid on board, and a Silicon Image-
based PCI Sata-raid card. System has 3x500GB sata drives installed, 2 on
the motherboard, and one on the PCI card. The system was installed from
a USB CD-drive, as it does not have a permanently attached CD-ROM drive.
The drives were partitioned as 1GB, 512MB and "the rest" - about 499GB.
Using software raid these were then formed into a 3 partition RAID1 set
for /boot, a 3 partition RAID1 set for swap, and a 3 partition RAID5 set
for / respectively. Install appeared to go normally, and on first reboot
the raid arrays were rebuilt. Some errors from the ata driver (?) were
reported on the console, but apart from significant slow downs in the
rebuild rate (drops from nearly 50MB/s to less than 8MB/s) there
appeared to be no problems. System was then lightly used for a couple of
days (some minor initial configuration work) and again I noticed a very
occasional error message on the console.

Stupidly, I didn't take note of the exact errors, but on examining my
kern.log, I can see that they would have been related to errors such as
the following (extracted from that file), which always relate to ata1,
which is the sata drive plugged into the PCI raid card (sda) :

Oct 31 12:08:51 house kernel: [  318.940000] ata1.00: exception Emask 0x0 SAct 
0x0 SErr 0x0 action 0x2 frozen
Oct 31 12:08:51 house kernel: [  318.940000] ata1.00: cmd 
c8/00:00:b8:27:52/00:00:00:00:00/ea tag 0 cdb 0x0 data 131072 in
Oct 31 12:08:51 house kernel: [  318.940000]          res 
40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
Oct 31 12:08:51 house kernel: [  319.270000] ata1: soft resetting port
Oct 31 12:08:51 house kernel: [  319.430000] ata1: SATA link up 1.5 Gbps 
(SStatus 113 SControl 310)
Oct 31 12:08:51 house kernel: [  319.490000] ata1.00: configured for UDMA/100
Oct 31 12:08:51 house kernel: [  319.490000] ata1: EH complete
Oct 31 12:08:51 house kernel: [  319.510000] sd 0:0:0:0: [sda] 976773168 
512-byte hardware sectors (500108 MB)
Oct 31 12:08:51 house kernel: [  319.520000] sd 0:0:0:0: [sda] Write Protect is 
off
Oct 31 12:08:51 house kernel: [  319.520000] sd 0:0:0:0: [sda] Mode Sense: 00 
3a 00 00
Oct 31 12:08:51 house kernel: [  319.550000] sd 0:0:0:0: [sda] Write cache: 
enabled, read cache: enabled, doesn't support DPO or FUA

Then, last night I tried to copy some 12-15GB of data to the system, and
I noticed *many* errors being echoed to the console, again all related
to ata1.  At some point in the process it appears that the ata driver
was unable to reset the port, even using a hard reset, and the drive was
"disabled", which caused the software raid system to remove that drives
partitions from the raid sets. Fortunately the system continued to run
on the other drives, but I couldn't get the ata1 drive up again.  I
needed to reboot the box to regain access to the drive.  I left the
system rebuilding the raid sets in single-user mode this morning ... no
errors were apparent on the log or console at that time, but I will add
anything I find when I get home this evening.

This problem looks similar to several other bugs in the system, though there 
are differences between this and them, as follows:
https://bugs.launchpad.net/ubuntu/+bug/64587 (and duplicates) ... their 
discussion seems indicate that a CD or DVD drive is involved
https://bugs.launchpad.net/ubuntu/+bug/84603 (and duplicates) ... again, 
discussion seems indicate that a CD or DVD drive is involved
https://bugs.launchpad.net/ubuntu/+bug/75295 (and duplicates) ... again, 
discussion seems indicate that a CD or DVD drive is involved
https://bugs.launchpad.net/ubuntu/+bug/103277 ... again discussion seems 
indicate that a CD or DVD drive is involved
https://bugs.launchpad.net/ubuntu/+bug/121612 ... problems with sata_sil 
reported, but not a good match to the symptoms I see
http://bugzilla.kernel.org/show_bug.cgi?id=8316 ... seems closely related to 
84603, and again seems focused on a CD or DVD drive being involved.

Will attach kern.log (from time of install to fail of raid system last
night), the output from lspci -vv and hdparm -I next.

** Affects: linux-source-2.6.22 (Ubuntu)
     Importance: Undecided
         Status: New

-- 
Port slow to respond on SiI3512 with sata_sil
https://bugs.launchpad.net/bugs/159521
You received this bug notification because you are a member of Ubuntu
Bugs, which is the bug contact for Ubuntu.

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

Reply via email to