[ceph-users] What are mon.-safe containers?

2020-10-08 Thread Sebastian Luna Valero
Hi, When I run `ceph orch ps` I see a couple of containers running on our MON nodes whose names end with the `-safe` suffix, and I was wondering what they are? I couldn't find information about it in https://docs.ceph.com This cluster is running Ceph 15.2.5, recently upgraded from 15.2.4 Many t

[ceph-users] Re: 15.2.8 mgr keep crashing every few days

2021-02-11 Thread Sebastian Luna Valero
Hi, The following thread on this emailing list might be relevant: https://lists.ceph.io/hyperkitty/list/ceph-users@ceph.io/thread/IAJRTIMFALJTZD3KYBHT4G7GEL6EHRR5/#IAJRTIMFALJTZD3KYBHT4G7GEL6EHRR5 Best regards, Sebastian El jue, 11 feb 2021 a las 10:32, levin ng () escribió: > Hi all, > > I’d

[ceph-users] Re: 15.2.8 mgr keep crashing every few days

2021-03-01 Thread Sebastian Luna Valero
2021 a las 21:40, David Orman () escribió: > This is fixed with 15.2.9 and the patch which was merged into that > release to fix the threading issue, coupled with an update cheroot > release now in the docker image. We're running 15.2.9 with no issue, > now! > > On Thu, Feb 1

[ceph-users] ceph orch status hangs forever

2021-05-19 Thread Sebastian Luna Valero
Hi, After an unschedule power outage our Ceph (Octopus) cluster reports a healthy state with: "ceph status". However, when we run "ceph orch status" the command hangs forever. Are there other commands that we can run for a more thorough health check of the cluster? After looking at: https://docs

[ceph-users] Re: ceph orch status hangs forever

2021-05-19 Thread Sebastian Luna Valero
d if the containers > are up and running (assuming it’s cephadm based)? Do the logs also > report the cluster as healthy? > > Zitat von Sebastian Luna Valero : > > > Hi, > > > > After an unschedule power outage our Ceph (Octopus) cluster reports a > > heal

[ceph-users] Re: ceph orch status hangs forever

2021-05-20 Thread Sebastian Luna Valero
Hi Eugen, Here it is: # ceph mgr module ls | jq -r '.enabled_modules[]' cephadm dashboard diskprediction_local iostat prometheus restful Should "crash" and "orchestrator" be part on the list? Why would have they disappeared in the first place? Best regards, Sebastian On Thu, 20 May 2021 at 15:

[ceph-users] Re: ceph orch status hangs forever

2021-05-21 Thread Sebastian Luna Valero
Hi Eugen, Thank you very much for your help! In the end we found that running "sudo systemctl restart ceph.target" on each Ceph node, one by one, monitoring the health of the Cluster with "ceph status" on a separate terminal has been the solution. After restarting everything all commands are now

[ceph-users] Re: Ceph Science meeting

2020-07-17 Thread Sebastian Luna Valero
Hi, FYI: https://lists.ceph.io/hyperkitty/list/ceph-users@ceph.io/thread/YCZKSW5S54JUU3J3TOW5P5V66MQLCBLM/ Best regards, Sebastian El vie., 17 jul. 2020 a las 17:56, Khan, Babar () escribió: > > Hi Ceph users, > > > Can someone please tell me when is Ceph Science user group meeting? > > Best r

[ceph-users] Re: Octopus OSDs dropping out of cluster: _check_auth_rotating possible clock skew, rotating keys expired way too early

2020-09-28 Thread Sebastian Luna Valero
Hi, Today I found the same error messages on the logs: -1 monclient: _check_auth_rotating possible clock skew, rotating keys expired way too early However, I found out after realising that Ceph was running without active manager: cluster: health: HEALTH_WARN no active mgr This