[ceph-users] Re: RGW sync gets stuck every day

2024-09-11 Thread Matthew Darwin
I'm on quincy. I had lots of problems with RGW getting stuck.  Once I dedicated 1 single RGW on each side to do replication, my problems went away.  Having a cluster of RGW behind a load balancer seemed to be confusing things. I still have multiple RGW for user-facing load, but a single RGW

[ceph-users] Re: RGW sync gets stuck every day

2024-09-11 Thread Olaf Seibert
So we still have this rgw synchronization that gets stuck every day and about the same time. We have alerting on it, so our on-call people are getting annoyed. Summarizing: we see on the receiving end of an rgw sync something like this: # radosgw-admin sync status --rgw-realm backup

[ceph-users] Re: RGW sync gets stuck every day

2024-08-08 Thread Olaf Seibert
On 08.08.24 08:31, Eugen Block wrote: >> Redeploying stuff seems like a much too big hammer to get things going again. Surely there must be something more reasonable? > > wouldn't a restart suffice? Probably, but when we were handling this the first time around, a redeploy was the first thing

[ceph-users] Re: RGW sync gets stuck every day

2024-08-07 Thread Eugen Block
Hi, Redeploying stuff seems like a much too big hammer to get things going again. Surely there must be something more reasonable? wouldn't a restart suffice? Do you see anything in the 'radosgw-admin sync error list'? Maybe an error prevents the sync from continuing? Zitat von Olaf Seibe