Hello Eugen,

On 20/12/2021 22:02, Eugen Block wrote:
you wrote that this cluster was initially installed with Octopus, so no upgrade ceph wise? Are all RGW daemons on the exact same ceph (minor) versions? I remember one of our customers reporting inconsistent objects on a regular basis although no hardware issues were detectable. They replicate between two sites, too. A couple of months ago both sites were updated to the same exact ceph minor version (also Octopus), they haven't faced inconsistencies since then. I don't have details about the ceph version(s) though, only that both sites were initially installed with Octopus. Maybe it's worth checking your versions?


Yes, everything has the same version:

{
[...]
   "overall": {
       "ceph version 15.2.15 (2dfb18841cfecc2f7eb7eb2afd65986ca4d95985) octopus (stable)": 34
   }
}

I just observed another 3 scrub errors. Strangely they never see to have occurred on the same pgs again. I shall be running another deep scrub on those OSD again to narrow this down.



But I am somewhat suspecting this to be a potential issue with the OMAP validation part of the scrubbing. For RADOSGW there are large OMAP structures with lots of movement. And the issues only are with the metadata pools.




Regards


Christian
_______________________________________________
ceph-users mailing list -- ceph-users@ceph.io
To unsubscribe send an email to ceph-users-le...@ceph.io

Reply via email to