Its Smart Storage battery, which was disabled due to high ambient temperature. All OSD processes/daemon working as is...but those OSDs not responding to other OSD due to high CPU utilization.. Don't observe the clock skew issue.
On Tue, Apr 16, 2019 at 12:49 PM Marco Gaiarin <g...@sv.lnf.it> wrote: > Mandi! M Ranga Swami Reddy > In chel di` si favelave... > > > Hello - Recevenlt we had an issue with storage node's battery failure, > which > > cause ceph client IO dropped to '0' bytes. Means ceph cluster couldn't > perform > > IO operations on the cluster till the node takes out. This is not > expected from > > Ceph, as some HW fails, those respective OSDs should mark as out/down > and IO > > should go as is.. > > Please let me know if anyone seen the similar behavior and is this issue > > resolved? > > 'battery' mean 'CMOS battery'? > > > OSDs and MONs need accurate clock sync between them. So, if a node > reboot with a clock skew more than (AFAI Remember well) 5 seconds, OSD > does not start. > > Provide a stable NTP server for all your OSDs and MONs, and restart > OSDs after clock are in sync. > > -- > dott. Marco Gaiarin GNUPG Key ID: > 240A3D66 > Associazione ``La Nostra Famiglia'' > http://www.lanostrafamiglia.it/ > Polo FVG - Via della Bontà, 7 - 33078 - San Vito al Tagliamento > (PN) > marco.gaiarin(at)lanostrafamiglia.it t +39-0434-842711 f > +39-0434-842797 > > Dona il 5 PER MILLE a LA NOSTRA FAMIGLIA! > http://www.lanostrafamiglia.it/index.php/it/sostienici/5x1000 > (cf 00307430132, categoria ONLUS oppure RICERCA SANITARIA) > _______________________________________________ > ceph-users mailing list > ceph-users@lists.ceph.com > http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com >
_______________________________________________ ceph-users mailing list ceph-users@lists.ceph.com http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com