I would also assume that a failing disk might cause this. Thanks for the update.

Zitat von Dietmar Rieder <dietmar.rie...@i-med.ac.at>:

Hi,

yes I can confirm this. Moreover we realized, that one of the osds in that pg (199) had a disk that was about to fail and the kernel reported "Sense Key : Medium Error".
So may be this is somehow related.

Thanks
   Dietmar

On 7/10/25 09:12, Eugen Block wrote:
Hi,

every thread I found so far mentioned that this resolved itself after some time. Maybe you can confirm?

Zitat von Dietmar Rieder <dietmar.rie...@i-med.ac.at>:

Hi,

our ceph cluster reported an inconsistent pg, so we set it to repair:

# ceph pg repair 4.b10

# ceph health detail
HEALTH_ERR 1 scrub errors; Possible data damage: 1 pg inconsistent
[ERR] OSD_SCRUB_ERRORS: 1 scrub errors
[ERR] PG_DAMAGED: Possible data damage: 1 pg inconsistent
    pg 4.b10 is active+clean+scrubbing+deep+inconsistent+repair, acting [109,10,214,148,60,58,199,129,326,165,35]


# ceph pg stat
6401 pgs: 1 active+clean+scrubbing+deep+inconsistent+repair, 147 active+clean+scrubbing, 146 active+clean+scrubbing+deep, 6107 active+clean; 1.1 PiB data, 1.5 PiB used, 2.4 PiB / 3.9 PiB avail; 19 MiB/s rd, 11 MiB/s wr, 14 op/s

When checking the osd logs on the osd server for osd.109 we just see every couple of seconds repeats of the following messages (which I'm not aware of having seen before when we used pg repair on other occasions):

[...]
2025-07-04T09:08:05.076+0000 7f3a063a7700  0 log_channel(cluster) log [INF] : osd.109 pg 4.b10s0 Deep scrub errors, upgrading scrub to deep- scrub 2025-07-04T09:08:05.076+0000 7f39f1b7e700  0 log_channel(cluster) log [DBG] : 4.b10 repair starts 2025-07-04T09:08:06.123+0000 7f3a063a7700  0 log_channel(cluster) log [INF] : osd.109 pg 4.b10s0 Deep scrub errors, upgrading scrub to deep- scrub 2025-07-04T09:08:06.123+0000 7f39f1b7e700  0 log_channel(cluster) log [DBG] : 4.b10 repair starts 2025-07-04T09:08:10.173+0000 7f3a063a7700  0 log_channel(cluster) log [INF] : osd.109 pg 4.b10s0 Deep scrub errors, upgrading scrub to deep- scrub 2025-07-04T09:08:10.196+0000 7f39f1b7e700  0 log_channel(cluster) log [DBG] : 4.b10 repair starts 2025-07-04T09:08:12.162+0000 7f3a063a7700  0 log_channel(cluster) log [INF] : osd.109 pg 4.b10s0 Deep scrub errors, upgrading scrub to deep- scrub 2025-07-04T09:08:12.162+0000 7f39f1b7e700  0 log_channel(cluster) log [DBG] : 4.b10 repair starts
[...]


Is this something to worry about?

Best
   Dietmar


_______________________________________________
ceph-users mailing list -- ceph-users@ceph.io
To unsubscribe send an email to ceph-users-le...@ceph.io



_______________________________________________
ceph-users mailing list -- ceph-users@ceph.io
To unsubscribe send an email to ceph-users-le...@ceph.io

Reply via email to