ithout_osd_lock
The ceph version is Octopus: 15.2.17.
OSD storage backend: bluestore
OS: CentOS7 64bit.
Any idea?
Thanks
&
Best regards,
Felix Lee ~
--
Felix H.T Lee Academia Sinica Grid & Cloud.
Tel: +886-2-27898308
Office: Room P111, Institute of Phy
Dear experts,
Sorry, I missed to mention that the initial symptom is that those OSDs
will suffer: "wait_auth_rotating timed out" and "unable to obtain
rotating service keys; retrying"
I then increased rotating_keys_bootstrap_timeout, but it doesn't really
help.
Best
oin time
and maybe improve it? because we always need to tell user the time
estimation of its recovery.
Thanks
&
Best regards,
Felix Lee ~
--
Felix H.T Lee Academia Sinica Grid & Cloud.
Tel: +886-2-27898308
Office: Room P111, Institute of Physics, 128 Academia
o 20 for a
while as ceph-mds.ceph16.log-20220516.gz
Thanks
&
Best regards,
Felix Lee ~
On 5/16/22 14:45, Jos Collin wrote:
It's hard to suggest without the logs. Do verbose logging debug_mds=20.
What's the ceph version? Do you have the logs why the MDS crashed?
On 16/05/22 11:
ere is any way for us to
estimate the rejoin time? So that we can decide whether to wait or take
proactive action if necessary.
Best regards,
Felix Lee ~
On 5/17/22 16:15, Jos Collin wrote:
I suggest you to upgrade the cluster to the latest release [1], as
nautilus reached EOL.
gives us good motivation to speed up the Ceph upgrade.
Again, thanks you all for the great inputs
&
Best regards,
Felix Lee ~
On 5/17/22 19:41, Dan van der Ster wrote:
Hi Felix,
"rejoin" took awhile in the past because the MDS needs to reload all
inodes for all the open directorie