We experienced on our Archive Lustre (ZFS based, 4 OST servers with 6 OSTs 
pools each) the very same issues as described here:
https://jira.whamcloud.com/browse/LU-13392
Certain directories cannot be accessed, and the OSTs shows thousands of errors 
"Can't find FID Sequence". Unfortunately I cannot even start the recommended 
file system checking on the OST devices  - example:
[root@elfsa2o1 ~]# lctl lfsck_start -o -M lfsarc02-OST0002
Fail to start LFSCK: Operation not permitted
[root@elfsa2o1 ~]# lctl lfsck_start -M lfsarc02-OST0002
Fail to start LFSCK: Operation not supported
On a similar system that was first installed as 2.10.4, then upgraded to 
2.10.8, and now is also running on 2.12.4, at least the second command starts:
# lctl lfsck_start -M lfsarc01-OST0002
The commands are issued on the system with the actual ZFS pools running.

Questions:
Is there any way to force the file system checks?
Has anyone found a workaround for the FID sequence errors?
Can I downgrade from 2.12.4 to 2.10.8 without destroying the FS?
Has the error described in https://jira.whamcloud.com/browse/LU-13392 been 
fixed in 
2.12.5<https://jira.whamcloud.com/browse/LU-13392%20been%20fixed%20in%202.12.5>?

Thanks
Michael


------------------------------------------------------------------------
Michael Hebenstreit                 Senior Cluster Architect
Intel Corporation, MS: RR1-105/H14  TSACG
1600 Rio Rancho Blvd SE             Tel.:   +1 505-794-3144
Rio Rancho, NM 87124
UNITED STATES                       E-mail: 
[email protected]<mailto:[email protected]>


_______________________________________________
lustre-discuss mailing list
[email protected]
http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org

Reply via email to