Hi all,
I have two ceph clusters in RGW multisite environment, with ~1500 bucketes
( 500M objects, 70TB ).
Some of the buckets are very dynamic (objects are constantly changing).
I have problems with large omap objects in bucket indexes, related with
"dynamic buckets".
For example:
[root@rgw ~]#
We had a outage of our Jewel 10.2.11 CephFS last night. Our primary MDS hit
an assert in ceph try_remove_dentries_for_stray(), but the replay MDS never
came up. The logs for MDS02 show:
---like clockwork these first two lines appear every second---
2019-08-02 16:27:24.664508 7f6f47f5c700 1 mds.0.