[ceph-users] OSD booting gets stuck after log_to_monitors step

2022-11-30 Thread Felix Lee
ithout_osd_lock The ceph version is Octopus: 15.2.17. OSD storage backend: bluestore OS: CentOS7 64bit. Any idea? Thanks & Best regards, Felix Lee ~ -- Felix H.T Lee Academia Sinica Grid & Cloud. Tel: +886-2-27898308 Office: Room P111, Institute of Phy

[ceph-users] Re: OSD booting gets stuck after log_to_monitors step

2022-11-30 Thread Felix Lee
Dear experts, Sorry, I missed to mention that the initial symptom is that those OSDs will suffer: "wait_auth_rotating timed out" and "unable to obtain rotating service keys; retrying" I then increased rotating_keys_bootstrap_timeout, but it doesn't really help. Best

[ceph-users] Reasonable MDS rejoin time?

2022-05-15 Thread Felix Lee
oin time and maybe improve it? because we always need to tell user the time estimation of its recovery. Thanks & Best regards, Felix Lee ~ -- Felix H.T Lee Academia Sinica Grid & Cloud. Tel: +886-2-27898308 Office: Room P111, Institute of Physics, 128 Academia

[ceph-users] Re: Reasonable MDS rejoin time?

2022-05-16 Thread Felix Lee
o 20 for a while as ceph-mds.ceph16.log-20220516.gz Thanks & Best regards, Felix Lee ~ On 5/16/22 14:45, Jos Collin wrote: It's hard to suggest without the logs. Do verbose logging debug_mds=20. What's the ceph version? Do you have the logs why the MDS crashed? On 16/05/22 11:

[ceph-users] Re: Reasonable MDS rejoin time?

2022-05-17 Thread Felix Lee
ere is any way for us to estimate the rejoin time? So that we can decide whether to wait or take proactive action if necessary. Best regards, Felix Lee ~ On 5/17/22 16:15, Jos Collin wrote: I suggest you to upgrade the cluster to the latest release [1], as nautilus reached EOL.

[ceph-users] Re: Reasonable MDS rejoin time?

2022-05-17 Thread Felix Lee
gives us good motivation to speed up the Ceph upgrade. Again, thanks you all for the great inputs & Best regards, Felix Lee ~ On 5/17/22 19:41, Dan van der Ster wrote: Hi Felix, "rejoin" took awhile in the past because the MDS needs to reload all inodes for all the open directorie