Re: [ceph-users] RGW multisite replication failures

2016-09-28 Thread Ben Morrice
Hello Orit, Thanks for your help so far. The bug you referenced was not included in 10.2.3. I cherry-picked the commits mentioned in http://tracker.ceph.com/issues/16742 into the 10.2.3 release and deployed this radosgw on the servers affected. Unfortunately it's still failing, now the sync and s

Re: [ceph-users] RGW multisite replication failures

2016-09-27 Thread Ben Morrice
Hello Orit, Yes, this bug looks to correlate. Was this included in 10.2.3? I guess not as I have since updated to 10.2.3 but getting the same errors This bug talks about not retrying after a failure, however do you know why the sync fails in the first place? It seems that basically any object ov

Re: [ceph-users] RGW multisite replication failures

2016-09-23 Thread Orit Wasserman
Hi Ben, It seems to be http://tracker.ceph.com/issues/16742. It is being backported to jewel http://tracker.ceph.com/issues/16794, you can try apply it and see if it helps you. Regards, Orit On Fri, Sep 23, 2016 at 9:21 AM, Ben Morrice wrote: > Hello all, > > I have two separate ceph (10.2.2) cl

[ceph-users] RGW multisite replication failures

2016-09-23 Thread Ben Morrice
Hello all, I have two separate ceph (10.2.2) clusters and have configured multisite replication between the two. I can see some buckets get synced, however others do not. Both clusters are RHEL7, and I have upgraded libcurl from 7.29 to 7.50 (to avoid http://tracker.ceph.com/issues/15915). Below