Hi all!

Reaching out again about this issue since I haven't had much luck. We've
been seeing some strange behavior with our object storage cluster. While
bucket stats (radosgw-admin bucket stats) normally return in a matter of
seconds, we frequently observe it taking almost ten minutes, which is not
convenient since we use those bucket stats for billing/accounting.
Restarting the radosgw process on the RGWs fixes this issue until it crops
up again in maybe a few days.

Someone mentioned that they think this might have to do with bucket
deletions, or more specifically, lifecycle policies to abort incomplete
multipart uploads. He mentioned there was an item in the bug tracker for
this, but I have not been able to find said bug in the tracker. I have no
clue if this is the case or not, but I figured I'd throw it out there to
see if anyone else has run into this problem. I have seen many of these
messages in my RGW logs:

2019-12-02 13:12:52.882 7faa7018f700 0 abort_bucket_multiparts WARNING :
aborted 8553000 incomplete multipart uploads

So maybe there is some truth to the aborted multipart uploads causing
problems?

My cluster has over 200 OSDs, 10 RGWs, about 2200 buckets. Running Nautilus
14.2.5.

If anyone has run into this or has any information I'd appreciate it.

Merry Christmas & Happy Holidays,
- Dave
_______________________________________________
ceph-users mailing list -- ceph-users@ceph.io
To unsubscribe send an email to ceph-users-le...@ceph.io

Reply via email to