Did you run the cluster with only a single monitor?

Paul

-- 
Paul Emmerich

Looking for help with your Ceph cluster? Contact us at https://croit.io

croit GmbH
Freseniusstr. 31h
81247 München
www.croit.io
Tel: +49 89 1896585 90


On Thu, Jun 27, 2019 at 4:32 PM Anton Aleksandrov <an...@aleksandrov.eu>
wrote:

> Hello community,
>
> we have developed a cluster on latest mimic release. We are on quite old
> hardware, but using Centos7. Monitor, manager and all the same host.
> Cluster has been running for some week without actual workload. There might
> have been some sort of power failure (not proved), but at some point
> monitor node died and won't start anymore. Below is a log from
> /var/log/messages. What can be done here? Can this be recovered somehow or
> did we loose everything? All the OSDs seems to be running fine, just that
> the cluster is not working.
>
> The log is not full, but I think that those line are quite critical..
>
> Jun 27 17:14:06 mds1 ceph-mon: -311> 2019-06-27 17:14:06.169 7f086aa22700
> -1 *rocksdb: submit_common error: Corruption: block checksum mismatch*:
> expected 3317957558, got 2609532897  in
> /var/lib/ceph/mon/ceph-mds1/store.db/022334.sst offset 12775887 size 21652
> code = 2 Rocksdb transaction:
> Jun 27 17:14:06 mds1 ceph-mon: Put( Prefix = p key =
> 'xos'0x006c6173't_committed' Value size = 8)
> Jun 27 17:14:06 mds1 ceph-mon: Put( Prefix = m key =
> 'nitor_store'0x006c6173't_metadata' Value size = 612)
> Jun 27 17:14:06 mds1 ceph-mon: Put( Prefix = l key =
> 'gm'0x0066756c'l_155850' Value size = 31307)
> Jun 27 17:14:06 mds1 ceph-mon: Put( Prefix = l key =
> 'gm'0x0066756c'l_latest' Value size = 8)
> Jun 27 17:14:06 mds1 ceph-mon: Put( Prefix = l key = 'gm'0x00313535'851'
> Value size = 672)
> Jun 27 17:14:06 mds1 ceph-mon: Put( Prefix = l key =
> 'gm'0x006c6173't_committed' Value size = 8)
> Jun 27 17:14:06 mds1 ceph-mon: -311> 2019-06-27 17:14:06.172 7f086aa22700
> -1 /home/jenkins-build/build/workspace/ceph-build/ARCH/x86_64/AVAILABLE
> _ARCH/x86_64/AVAILABLE_DIST/centos7/DIST/centos7/MACHINE_SIZE/huge/release/13.2.6/rpm/el7/BUILD/ceph-13.2.6/src/mon/MonitorDBStore.h:
> In function
>  'int MonitorDBStore::apply_transaction(MonitorDBStore::TransactionRef)'
> thread 7f086aa22700 time 2019-06-27 17:14:06.171474
> Jun 27 17:14:06 mds1 ceph-mon:
> /home/jenkins-build/build/workspace/ceph-build/ARCH/x86_64/AVAILABLE_ARCH/x86_64/AVAILABLE_DIST/centos7/DIST/cento
> s7/MACHINE_SIZE/huge/release/13.2.6/rpm/el7/BUILD/ceph-13.2.6/src/mon/MonitorDBStore.h:
> 311: FAILED assert(0 ==* "failed to write to db"*)
> Jun 27 17:14:06 mds1 ceph-mon: ceph version 13.2.6
> (7b695f835b03642f85998b2ae7b6dd093d9fbce4) mimic (stable)
>
> _______________________________________________
> ceph-users mailing list
> ceph-users@lists.ceph.com
> http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com
>
_______________________________________________
ceph-users mailing list
ceph-users@lists.ceph.com
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

Reply via email to