Re: [ceph-users] CEPH failuers after 5 journals down

2016-12-08 Thread Krzysztof Nowicki
Hi, We have managed to work around this problem. It seems that one of the objects lost due to the SSD failure was part of the hitset for the cache tier. The OSD was dying after an attempt to open and read the object's data file. In an effort to progress I have found out which file is missing by

[ceph-users] CEPH failuers after 5 journals down

2016-12-08 Thread Wojciech KobryƄ
Hi Ceph Users ! I've got here a CEPH cluster: 6 nodes, 12 OSDs on HDD and SSD disks. All journal OSDs on SSDs. 25 various HDDs in total. We had several HDD failures in past, but every time - it was HDD failure and it was never journal related. After replacing HDD, and recovery procedures all was