Hello, Has anyone ever seen vdev's getting removed and added back to the pool very quickly ? That seems to be what's happening here.
This has started to happen on dozens of machines at different locations since a few days ago. They are running OpenSolaris b111 and a few b126. Could this be bit rot and/or silent corruption getting detected and fixed ? Jan 27 01:18:01 hostname fmd: [ID 441519 daemon.notice] SUNW-MSG-ID: FMD-8000-4M, TYPE: Repair, VER: 1, SEVERITY: Minor Jan 27 01:18:01 hostname EVENT-TIME: Thu Dec 24 08:50:34 BRST 2009 Jan 27 01:18:01 hostname PLATFORM: X7DB8, CSN: 0123456789, HOSTNAME: hostname Jan 27 01:18:01 hostname SOURCE: fmd, REV: 1.2 Jan 27 01:18:01 hostname EVENT-ID: 0cb73c5a-d444-ede6-e49f-fce4aad8a1cd Jan 27 01:18:01 hostname DESC: All faults associated with an event id have been addressed. Jan 27 01:18:01 hostname Refer to http://sun.com/msg/FMD-8000-4M for more information. Jan 27 01:18:01 hostname AUTO-RESPONSE: Some system components offlined because of the original fault may have been brought back online. Jan 27 01:18:01 hostname IMPACT: Performance degradation of the system due to the original fault may have been recovered. Jan 27 01:18:01 hostname REC-ACTION: Use fmdump -v -u <EVENT-ID> to identify the repaired components. Jan 27 01:18:01 hostname fmd: [ID 441519 daemon.notice] SUNW-MSG-ID: FMD-8000-6U, TYPE: Resolved, VER: 1, SEVERITY: Minor Jan 27 01:18:01 hostname EVENT-TIME: Thu Dec 24 08:50:34 BRST 2009 Jan 27 01:18:01 hostname PLATFORM: X7DB8, CSN: 0123456789, HOSTNAME: hostname Jan 27 01:18:01 hostname SOURCE: fmd, REV: 1.2 Jan 27 01:18:01 hostname EVENT-ID: 0cb73c5a-d444-ede6-e49f-fce4aad8a1cd Jan 27 01:18:01 hostname DESC: All faults associated with an event id have been addressed. Jan 27 01:18:01 hostname Refer to http://sun.com/msg/FMD-8000-6U for more information. Jan 27 01:18:01 hostname AUTO-RESPONSE: All system components offlined because of the original fault have been brought back online. Jan 27 01:18:01 hostname IMPACT: Performance degradation of the system due to the original fault has been recovered. Jan 27 01:18:01 hostname REC-ACTION: Use fmdump -v -u <EVENT-ID> to identify the repaired components. # fmdump -e -t 23Jan2010 TIME CLASS # # fmdump TIME UUID SUNW-MSG-ID Jan 27 01:18:01.2372 0cb73c5a-d444-ede6-e49f-fce4aad8a1cd FMD-8000-4M Repaired Jan 27 01:18:01.2391 0cb73c5a-d444-ede6-e49f-fce4aad8a1cd FMD-8000-6U Resolved # fmdump -V TIME UUID SUNW-MSG-ID Jan 27 01:18:01.2372 0cb73c5a-d444-ede6-e49f-fce4aad8a1cd FMD-8000-4M Repaired TIME CLASS ENA Dec 24 08:50:34.4470 ereport.fs.zfs.vdev.corrupt_data 0x533bf0e964a01801 Dec 23 16:08:42.0738 ereport.fs.zfs.probe_failure 0xe87b448c8ba00c01 Dec 23 16:08:42.0739 ereport.fs.zfs.io 0xe87b446b04f00001 Dec 23 16:08:42.0739 ereport.fs.zfs.io 0xe87b44664b300401 Dec 23 16:08:42.0738 ereport.fs.zfs.io 0xe87b445710a01001 Dec 23 16:08:42.0739 ereport.fs.zfs.io 0xe87b4461a4d00c01 nvlist version: 0 version = 0x0 class = list.repaired uuid = 0cb73c5a-d444-ede6-e49f-fce4aad8a1cd code = FMD-8000-4M diag-time = 1261651834 766268 de = (embedded nvlist) nvlist version: 0 version = 0x0 scheme = fmd authority = (embedded nvlist) nvlist version: 0 version = 0x0 product-id = X7DB8 chassis-id = 0123456789 server-id = hostname (end authority) mod-name = fmd mod-version = 1.2 (end de) fault-list-sz = 0x1 fault-list = (array of embedded nvlists) (start fault-list[0]) nvlist version: 0 version = 0x0 class = fault.fs.zfs.device certainty = 0x64 asru = (embedded nvlist) nvlist version: 0 version = 0x0 scheme = zfs pool = 0x9f4842f183c4c7cc vdev = 0xd207014426714df9 (end asru) resource = (embedded nvlist) nvlist version: 0 version = 0x0 scheme = zfs pool = 0x9f4842f183c4c7cc vdev = 0xd207014426714df9 (end resource) (end fault-list[0]) fault-status = 0x6 __ttl = 0x1 __tod = 0x4b5fb069 0xe23eb38 TIME UUID SUNW-MSG-ID Jan 27 01:18:01.2391 0cb73c5a-d444-ede6-e49f-fce4aad8a1cd FMD-8000-6U Resolved TIME CLASS ENA Dec 24 08:50:34.4470 ereport.fs.zfs.vdev.corrupt_data 0x533bf0e964a01801 Dec 23 16:08:42.0738 ereport.fs.zfs.probe_failure 0xe87b448c8ba00c01 Dec 23 16:08:42.0739 ereport.fs.zfs.io 0xe87b446b04f00001 Dec 23 16:08:42.0739 ereport.fs.zfs.io 0xe87b44664b300401 Dec 23 16:08:42.0738 ereport.fs.zfs.io 0xe87b445710a01001 Dec 23 16:08:42.0739 ereport.fs.zfs.io 0xe87b4461a4d00c01 nvlist version: 0 version = 0x0 class = list.resolved uuid = 0cb73c5a-d444-ede6-e49f-fce4aad8a1cd code = FMD-8000-6U diag-time = 1261651834 766268 de = (embedded nvlist) nvlist version: 0 version = 0x0 scheme = fmd authority = (embedded nvlist) nvlist version: 0 version = 0x0 product-id = X7DB8 chassis-id = 0123456789 server-id = hostname (end authority) mod-name = fmd mod-version = 1.2 (end de) fault-list-sz = 0x1 fault-list = (array of embedded nvlists) (start fault-list[0]) nvlist version: 0 version = 0x0 class = fault.fs.zfs.device certainty = 0x64 asru = (embedded nvlist) nvlist version: 0 version = 0x0 scheme = zfs pool = 0x9f4842f183c4c7cc vdev = 0xd207014426714df9 (end asru) resource = (embedded nvlist) nvlist version: 0 version = 0x0 scheme = zfs pool = 0x9f4842f183c4c7cc vdev = 0xd207014426714df9 (end resource) (end fault-list[0]) fault-status = 0x6 __ttl = 0x1 __tod = 0x4b5fb069 0xe411fc8 Thanks, -- Giovanni _______________________________________________ zfs-discuss mailing list zfs-discuss@opensolaris.org http://mail.opensolaris.org/mailman/listinfo/zfs-discuss