Hi there!

Failure of one disk destroyed all my data.  Physical removal of broken 
disks caused zpool to make another disk to appear twice causing final 
data corruption.  Everything was fine until broken disks c7t0d0 was 
removed.  Is there any way to remove second c5t0d0 from pool which 
should be missing c7t0d0 ? How to build this pool back to degraded mode 
with two working disks left?

   Tomppa

Timeline:

*** Jan  2007 raidz1 zpool v was built using 3 Lacie 500G USB disks 
c5t0d0, c7t0d0 and c8t0d0

*** Jul  6 11:12 c7t0d0 fails
> Jul  6 11:12:29 iki scsi: [ID 107833 kern.warning] WARNING: /[EMAIL 
> PROTECTED],700000/[EMAIL PROTECTED],2/[EMAIL PROTECTED]/[EMAIL PROTECTED],0 
> (sd3):
> Jul  6 11:12:29 iki     SCSI transport failed: reason 'timeout': retrying 
> command
> Jul  6 11:12:33 iki scsi: [ID 107833 kern.warning] WARNING: /[EMAIL 
> PROTECTED],700000/[EMAIL PROTECTED],2/[EMAIL PROTECTED]/[EMAIL PROTECTED],0 
> (sd3):
> Jul  6 11:12:33 iki     SCSI transport failed: reason 'tran_err': retrying 
> command
> Jul  6 11:13:33 iki     Error for Command: read(10)                Error 
> Level: Retryable
> Jul  6 11:13:33 iki scsi: [ID 107833 kern.notice]       Requested Block: 
> 90452642                  Error Block: 90452642
> Jul  6 11:13:33 iki scsi: [ID 107833 kern.notice]       Vendor: SAMSUNG       
>                      Serial Number:             
> Jul  6 11:13:33 iki scsi: [ID 107833 kern.notice]       Sense Key: No 
> Additional Sense
> Jul  6 11:13:33 iki scsi: [ID 107833 kern.notice]       ASC: 0x0 (no 
> additional sense info), ASCQ: 0x0, FRU: 0x0
> Jul  6 11:13:33 iki scsi: [ID 107833 kern.warning] WARNING: /[EMAIL 
> PROTECTED],700000/[EMAIL PROTECTED],2/[EMAIL PROTECTED]/[EMAIL PROTECTED],0 
> (sd3):
> Jul  6 11:13:33 iki     Error for Command: read(10)                Error 
> Level: Retryable
> Jul  6 11:13:33 iki scsi: [ID 107833 kern.notice]       Requested Block: 
> 90452642                  Error Block: 90452642
> Jul  6 11:13:33 iki scsi: [ID 107833 kern.notice]       Vendor: SAMSUNG       
>                      Serial Number:             
> Jul  6 11:13:33 iki scsi: [ID 107833 kern.notice]       Sense Key: No 
> Additional Sense
> Jul  6 11:13:33 iki scsi: [ID 107833 kern.notice]       ASC: 0x0 (no 
> additional sense info), ASCQ: 0x0, FRU: 0x0
> Jul  6 11:13:33 iki scsi: [ID 107833 kern.warning] WARNING: /[EMAIL 
> PROTECTED],700000/[EMAIL PROTECTED],2/[EMAIL PROTECTED]/[EMAIL PROTECTED],0 
> (sd3):

*** Jul  8 03:35:26
resilver completed with 0 errors on Sun Jul  8 03:35:26 2007


*** Jul  8 12:00
> % zpool status
>  pool: v
> state: ONLINE
> status: One or more devices has experienced an unrecoverable error.  An
>        attempt was made to correct the error.  Applications are
> unaffected.
> action: Determine if the device needs to be replaced, and clear the
> errors
>        using 'zpool clear' or replace the device with 'zpool replace'.
>   see: http://www.sun.com/msg/ZFS-8000-9P
> scrub: resilver completed with 0 errors on Sun Jul  8 03:35:26 2007
> config:
>
>        NAME        STATE     READ WRITE CKSUM
>        v           ONLINE       0     0   369
>          raidz1    ONLINE       0     0   369
>            c5t0d0  ONLINE       0     0     1
>            c7t0d0  ONLINE      29 2.54K   507
>            c8t0d0  ONLINE       0     0     0
>
> errors: No known data errors
> % format -e
> Searching for disks...done
>
>
> AVAILABLE DISK SELECTIONS:
>       0. c2t0d0 <SUN146G cyl 14087 alt 2 hd 24 sec 848>
>          /[EMAIL PROTECTED],600000/SUNW,[EMAIL PROTECTED]/[EMAIL 
> PROTECTED],0/[EMAIL PROTECTED],0
>       1. c2t1d0 <SUN146G cyl 14087 alt 2 hd 24 sec 848>
>          /[EMAIL PROTECTED],600000/SUNW,[EMAIL PROTECTED]/[EMAIL 
> PROTECTED],0/[EMAIL PROTECTED],0
>       2. c5t0d0 <SAMSUNG-HD501LJ-CR10-465.76GB>
>          /[EMAIL PROTECTED],700000/[EMAIL PROTECTED],2/[EMAIL 
> PROTECTED]/[EMAIL PROTECTED],0
>       3. c7t0d0 <drive not available>
>          /[EMAIL PROTECTED],700000/[EMAIL PROTECTED],2/[EMAIL 
> PROTECTED]/[EMAIL PROTECTED],0
>       4. c8t0d0 <SAMSUNG-HD501LJ-CR10-465.76GB>
>          /[EMAIL PROTECTED],700000/[EMAIL PROTECTED],2/[EMAIL 
> PROTECTED]/[EMAIL PROTECTED],0
> Specify disk (enter its number): ^D
>
> % iostat -En
> c5t0d0           Soft Errors: 900 Hard Errors: 0 Transport Errors: 0 
> Vendor: SAMSUNG  Product: HD501LJ          Revision: CR10 Serial No:  
> Size: 500.11GB <500107862016 bytes>
> Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0 Illegal 
> Request: 900 Predictive Failure Analysis: 0
 > sd3              Soft Errors: 4258 Hard Errors: 2 Transport Errors: 2698
 > Vendor: SAMSUNG  Product: HD501LJ          Revision: CR10 Serial No:
 > Size: 500.11GB <500107862016 bytes>
> Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0 Illegal 
> Request: 787 Predictive Failure Analysis: 0
 > c8t0d0           Soft Errors: 843 Hard Errors: 0 Transport Errors: 0
 > Vendor: SAMSUNG  Product: HD501LJ          Revision: CR10 Serial No:
 > Size: 500.11GB <500107862016 bytes>
> Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0 Illegal 
> Request: 843 Predictive Failure Analysis: 0 


*** Jul  8 14:12 c7t0d0 physically removed
> % zpool status         pool: v
>  state: ONLINE
> status: One or more devices has experienced an error resulting in data
>         corruption.  Applications may be affected.
> action: Restore the file in question if possible.  Otherwise restore the
>         entire pool from backup.
>    see: http://www.sun.com/msg/ZFS-8000-8A
>  scrub: resilver completed with 0 errors on Sun Jul  8 03:35:26 2007
> config:
>
>         NAME        STATE     READ WRITE CKSUM
>         v           ONLINE       0     0 9.96K
>           raidz1    ONLINE       0     0 9.96K
>             c5t0d0  ONLINE       0     0    40
>             c5t0d0  ONLINE      29 2.54K 1.35K
>             c8t0d0  ONLINE       0     0     0
>
> errors: 0 data errors, use '-v' for a list

*** Jul 10 09:30 zfs unmount v ; zpool export v ; zpool import v
which caused this panic
> Jul 10 09:30:19 iki savecore: [ID 570001 auth.error] reboot after panic: 
> asserti
> on failed: dmu_read(os, smo->smo_object, offset, size, entry_map) == 0 (0x6 
> == 0
> x0), file: ../../common/fs/zfs/space_map.c, line: 307

*** Jul 10 14:37
> % zpool import
>   pool: v
>     id: 16534952157184541936
>  state: DEGRADED
> status: One or more devices contains corrupted data.
> action: The pool can be imported despite missing or damaged devices.  The
>         fault tolerance of the pool may be compromised if imported.
>    see: http://www.sun.com/msg/ZFS-8000-4J
> config:
> 
>         v           DEGRADED
>           raidz1    DEGRADED
>             c5t0d0  FAULTED   corrupted data
>             c5t0d0  ONLINE
>             c8t0d0  ONLINE
_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Reply via email to