More detailed SMART info: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 0 3 Spin_Up_Time 0x0027 241 201 021 Pre-fail Always - 6941 4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 39 5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0 7 Seek_Error_Rate 0x002e 100 253 000 Old_age Always - 0 9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 58 10 Spin_Retry_Count 0x0032 100 253 000 Old_age Always - 0 11 Calibration_Retry_Count 0x0032 100 253 000 Old_age Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 39 192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 31 193 Load_Cycle_Count 0x0032 200 200 000 Old_age Always - 24 194 Temperature_Celsius 0x0022 119 103 000 Old_age Always - 33 196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0 197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0030 100 253 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0 200 Multi_Zone_Error_Rate 0x0008 100 253 000 Old_age Offline - 0
SMART Error Log Version: 1 No Errors Logged --------------- ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x002f 100 253 051 Pre-fail Always - 0 3 Spin_Up_Time 0x0027 243 200 021 Pre-fail Always - 6816 4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 39 5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0 7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0 9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 58 10 Spin_Retry_Count 0x0032 100 253 000 Old_age Always - 0 11 Calibration_Retry_Count 0x0032 100 253 000 Old_age Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 39 192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 31 193 Load_Cycle_Count 0x0032 200 200 000 Old_age Always - 25 194 Temperature_Celsius 0x0022 120 105 000 Old_age Always - 32 196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0 197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0030 100 253 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0 200 Multi_Zone_Error_Rate 0x0008 100 253 000 Old_age Offline - 0 SMART Error Log Version: 1 No Errors Logged ** Description changed: - Writing lots of data to a 3 TB ext4-formatted logical volume on a 2x6 TB - mdadm raid 1 mirror always results in a GPF within 1 minute of writing - the data. Disks seem to be OK smart-wise. + Writing data to a 3 TB ext4-formatted logical volume on a 2x6 TB mdadm + raid 1 mirror always results in a GPF within 1 minute of writing the + data. Disks seem to be OK smart-wise. When the GPF occurs, the md1 raid volume gets degraded, the entire system gets irresponsive and after the next boot (hard reset), the raid syncing starts from scratch (which takes 10 hours with the two 6 TB - drives to complete). The GPF occurs when the raid is still syncing and - also after it is in sync. + drives). The GPF always happens in mpage_process_page_bufs: Nov 16 09:26:03 nas kernel: [ 230.610495] ------------[ cut here ]------------ Nov 16 09:26:03 nas kernel: [ 230.610510] WARNING: CPU: 1 PID: 1927 at /build/buildd/linux-3.13.0/kernel/exit.c:711 do_exit+0x55/0xa50() Nov 16 09:26:03 nas kernel: [ 230.610531] Modules linked in: ip6table_filter ip6_tables iptable_filter ip_tables ebtable_nat ebtables x_tables snd_hda_codec_hdmi arc4 bridge stp llc x86_pkg_temp_thermal intel_powerc$ Nov 16 09:26:03 nas kernel: [ 230.610810] CPU: 1 PID: 1927 Comm: vsftpd Tainted: G D 3.13.0-39-generic #66-Ubuntu Nov 16 09:26:03 nas kernel: [ 230.610830] Hardware name: Gigabyte Technology Co., Ltd. H97N-WIFI/H97N-WIFI, BIOS F4 06/28/2014 Nov 16 09:26:03 nas kernel: [ 230.610849] 0000000000000009 ffff8800cb61d9e0 ffffffff8171ece7 0000000000000000 Nov 16 09:26:03 nas kernel: [ 230.610877] ffff8800cb61da18 ffffffff8106773d 000000000000000b ffff8800cb61db38 Nov 16 09:26:03 nas kernel: [ 230.610898] 0000000000000246 0000000000000000 ffff8800cb633000 ffff8800cb61da28 Nov 16 09:26:03 nas kernel: [ 230.610920] Call Trace: Nov 16 09:26:03 nas kernel: [ 230.610928] [<ffffffff8171ece7>] dump_stack+0x45/0x56 Nov 16 09:26:03 nas kernel: [ 230.610941] [<ffffffff8106773d>] warn_slowpath_common+0x7d/0xa0 Nov 16 09:26:03 nas kernel: [ 230.610956] [<ffffffff8106781a>] warn_slowpath_null+0x1a/0x20 Nov 16 09:26:03 nas kernel: [ 230.610970] [<ffffffff81069a45>] do_exit+0x55/0xa50 Nov 16 09:26:03 nas kernel: [ 230.610984] [<ffffffff817282b9>] oops_end+0xa9/0x150 Nov 16 09:26:03 nas kernel: [ 230.610997] [<ffffffff810171cb>] die+0x4b/0x70 Nov 16 09:26:03 nas kernel: [ 230.611008] [<ffffffff81727c4e>] do_general_protection+0x11e/0x1b0 Nov 16 09:26:03 nas kernel: [ 230.611024] [<ffffffff81727568>] general_protection+0x28/0x30 Nov 16 09:26:03 nas kernel: [ 230.611039] [<ffffffff8123e0b8>] ? mpage_process_page_bufs+0x38/0x130 Nov 16 09:26:03 nas kernel: [ 230.611055] [<ffffffff8123f0e9>] mpage_prepare_extent_to_map+0x219/0x2c0 Nov 16 09:26:03 nas kernel: [ 230.611071] [<ffffffff81242ef2>] ext4_writepages+0x3f2/0xd20 Nov 16 09:26:03 nas kernel: [ 230.611086] [<ffffffff8115a9be>] do_writepages+0x1e/0x40 Nov 16 09:26:03 nas kernel: [ 230.611101] [<ffffffff8114fac9>] __filemap_fdatawrite_range+0x59/0x60 Nov 16 09:26:03 nas kernel: [ 230.611117] [<ffffffff8114fb7c>] filemap_flush+0x1c/0x20 Nov 16 09:26:03 nas kernel: [ 230.611130] [<ffffffff81240928>] ext4_alloc_da_blocks+0x38/0x70 Nov 16 09:26:03 nas kernel: [ 230.611156] [<ffffffff81239e89>] ext4_release_file+0x79/0xc0 Nov 16 09:26:03 nas kernel: [ 230.611171] [<ffffffff811bed84>] __fput+0xe4/0x260 Nov 16 09:26:03 nas kernel: [ 230.611184] [<ffffffff811bef4e>] ____fput+0xe/0x10 Nov 16 09:26:03 nas kernel: [ 230.611197] [<ffffffff81088227>] task_work_run+0xa7/0xe0 Nov 16 09:26:03 nas kernel: [ 230.611210] [<ffffffff81013df7>] do_notify_resume+0x97/0xb0 Nov 16 09:26:03 nas kernel: [ 230.611225] [<ffffffff8172faaa>] int_signal+0x12/0x17 Nov 16 09:26:03 nas kernel: [ 230.612259] ---[ end trace c2c7ff1f119c2b75 ]--- ---------------------------------------------------------------------------------------- apt-cache policy e2fslibs:amd64 e2fslibs: Installed: 1.42.9-3ubuntu1 Candidate: 1.42.9-3ubuntu1 Version table: *** 1.42.9-3ubuntu1 0 500 http://ch.archive.ubuntu.com/ubuntu/ trusty/main amd64 Packages 100 /var/lib/dpkg/status ---------------------------------------------------------------------------------------- smartctl -H /dev/sdc smartctl 6.2 2013-07-26 r3841 [x86_64-linux-3.13.0-39-generic] (local build) Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org smartctl -H /dev/sdc smartctl 6.2 2013-07-26 r3841 [x86_64-linux-3.13.0-39-generic] (local build) Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED smartctl -H /dev/sdd smartctl 6.2 2013-07-26 r3841 [x86_64-linux-3.13.0-39-generic] (local build) Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED ---------------------------------------------------------------------------------------- /dev/md1: Version : 1.2 Creation Time : Sat Nov 15 20:27:36 2014 Raid Level : raid1 Array Size : 5860389696 (5588.90 GiB 6001.04 GB) Used Dev Size : 5860389696 (5588.90 GiB 6001.04 GB) Raid Devices : 2 Total Devices : 2 Persistence : Superblock is persistent Intent Bitmap : Internal Update Time : Sun Nov 16 09:15:47 2014 State : active Active Devices : 2 Working Devices : 2 Failed Devices : 0 Spare Devices : 0 Name : nas:1 (local to host nas) UUID : 035d6d6e:d213adb0:56b6b6d8:d0cf202c Events : 155 Number Major Minor RaidDevice State 0 8 33 0 active sync /dev/sdc1 1 8 49 1 active sync /dev/sdd1 ---------------------------------------------------------------------------------------- --- Physical volume --- PV Name /dev/md1 VG Name vg_hdd1 PV Size 5.46 TiB / not usable 4.81 MiB Allocatable yes PE Size 4.00 MiB Total PE 1430758 Free PE 644326 Allocated PE 786432 PV UUID M0yvqu-vnTR-lNVg-nxRL-CSzK-hVCD-DDWcet ---------------------------------------------------------------------------------------- --- Volume group --- VG Name vg_hdd1 System ID Format lvm2 Metadata Areas 1 Metadata Sequence No 2 VG Access read/write VG Status resizable MAX LV 0 Cur LV 1 Open LV 0 Max PV 0 Cur PV 1 Act PV 1 VG Size 5.46 TiB PE Size 4.00 MiB Total PE 1430758 Alloc PE / Size 786432 / 3.00 TiB Free PE / Size 644326 / 2.46 TiB VG UUID MVW4e0-oeYT-RKPp-j2E9-7RT2-WBjt-Zof2Nn ---------------------------------------------------------------------------------------- --- Logical volume --- LV Path /dev/vg_hdd1/lv_share LV Name lv_share VG Name vg_hdd1 LV UUID VhywGd-c3kY-IXRd-SomS-ypAe-a4rg-5oQKKE LV Write Access read/write LV Creation host, time nas, 2014-11-16 09:17:51 +0100 LV Status available # open 0 LV Size 3.00 TiB Current LE 786432 Segments 1 Allocation inherit Read ahead sectors auto - currently set to 256 Block device 252:4 ---------------------------------------------------------------------------------------- lshw -short H/W path Device Class Description ==================================================== system H97N-WIFI (To be filled by O.E.M.) /0 bus H97N-WIFI /0/0 memory 64KiB BIOS /0/3d processor Intel(R) Core(TM) i3-4160 CPU @ 3.60GHz /0/3d/3e memory 128KiB L1 cache /0/3d/3f memory 512KiB L2 cache /0/3d/40 memory 3MiB L3 cache /0/42 memory 8GiB System Memory /0/42/0 memory 4GiB DIMM DDR3 Synchronous 1600 MHz (0.6 ns) /0/42/1 memory DIMM [empty] /0/42/2 memory 4GiB DIMM DDR3 Synchronous 1600 MHz (0.6 ns) /0/42/3 memory DIMM [empty] /0/100 bridge 4th Gen Core Processor DRAM Controller /0/100/2 display Intel Corporation /0/100/3 multimedia Xeon E3-1200 v3/4th Gen Core Processor HD Audio Controller /0/100/14 bus Intel Corporation /0/100/16 communication Intel Corporation /0/100/19 eth1 network Ethernet Connection I217-V /0/100/1a bus Intel Corporation /0/100/1b multimedia Intel Corporation /0/100/1c bridge Intel Corporation /0/100/1c.3 bridge Intel Corporation /0/100/1c.3/0 eth0 network AR8161 Gigabit Ethernet /0/100/1c.4 bridge Intel Corporation /0/100/1c.4/0 wlan0 network Wireless 7260 /0/100/1d bus Intel Corporation /0/100/1f bridge Intel Corporation /0/100/1f.2 storage Intel Corporation /0/100/1f.3 bus Intel Corporation /0/1 scsi1 storage /0/1/0.0.0 /dev/sda disk 120GB INTEL SSDSC2CT12 /0/1/0.0.0/1 /dev/sda1 volume 510MiB Windows FAT volume /0/1/0.0.0/2 /dev/sda2 volume 111GiB RAID partition /0/2 scsi3 storage /0/2/0.0.0 /dev/sdb disk 120GB INTEL SSDSC2CT12 /0/2/0.0.0/1 /dev/sdb1 volume 510MiB Windows FAT volume /0/2/0.0.0/2 /dev/sdb2 volume 111GiB RAID partition /0/3 scsi4 storage /0/3/0.0.0 /dev/sdc disk 6001GB WDC WD60EFRX-68M /0/3/0.0.0/1 /dev/sdc1 volume 5589GiB RAID partition /0/4 scsi5 storage /0/4/0.0.0 /dev/sdd disk 6001GB WDC WD60EFRX-68M /0/4/0.0.0/1 /dev/sdd1 volume 5589GiB RAID partition /1 power To Be Filled By O.E.M. ---------------------------------------------------------------------------------------- Linux nas 3.13.0-39-generic #66-Ubuntu SMP Tue Oct 28 13:30:27 UTC 2014 x86_64 x86_64 x86_64 GNU/Linux Description: Ubuntu 14.04.1 LTS Release: 14.04 -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1393151 Title: Writing to ext4 LVM FS causes GPF in mpage_process_page_bufs To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/e2fsprogs/+bug/1393151/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs