[ceph-users] ceph-osd suddenly dies and no longer can be started

2015-08-18 Thread Евгений Д .
Hello. I have a small Ceph cluster running 9 OSDs, using XFS on separate disks and dedicated partitions on system disk for journals. After creation it worked fine for a while. Then suddenly one of OSDs stopped and didn't start. I had to recreate it. Recovery started. After few days of recovery OSD

Re: [ceph-users] ceph-osd suddenly dies and no longer can be started

2015-08-19 Thread Евгений Д .
. But this "malloc -> ENOMEM/OOM killer -> corrupted journal -> trying to recover -> ENOMEM/OOM killer ..." looks like a bug. 2015-08-19 0:13 GMT+03:00 Евгений Д. : > Hello. > > I have a small Ceph cluster running 9 OSDs, using XFS on separate disks > and dedi

[ceph-users] OSD won't go up after node reboot

2015-08-29 Thread Евгений Д .
I'm running 3-node cluster with Ceph (it's Deis cluster, so Ceph daemons are containerized). There are 3 OSDs and 3 mons. After rebooting all nodes one by one all monitors are up, but only two OSDs of three are up. 'Down' OSD is really running but is never marked up/in. All three mons are reachable

Re: [ceph-users] OSD won't go up after node reboot

2015-08-31 Thread Евгений Д .
ally looks like something was broken on reboot, probably during container start, so it's not really related to Ceph. I'll go with OSD recreation. Thank you. 2015-08-31 11:50 GMT+03:00 Gregory Farnum : > On Sat, Aug 29, 2015 at 3:32 PM, Евгений Д. wrote: > > I'm running 3

Re: [ceph-users] OSD won't go up after node reboot

2015-09-01 Thread Евгений Д .
ssible that something else was mounted there? > Or is it possible nothing was mounted there? > That would explain such behaviour... > > Jan > > On 31 Aug 2015, at 17:07, Евгений Д. wrote: > > No, it really was in the cluster. Before reboot cluster had HEALTH_OK. > Though now I