A colleague of mine suggested to create a coredump when the MDS has become stale and then inspect it with gdb. But if you think it’s more promising to increase the buffer, or maybe it’s quicker to test, then do that first.

Zitat von Frank Schilder <fr...@dtu.dk>:

which is 3758096384. I'm not even sure what the unit is, probably bytes?

Sorry, it is bytes. Our items are about 100b on average, that's how we observe approximately 37462448 executions of purge_stale_snap_data until the queue is filled up.

Best regards,
=================
Frank Schilder
AIT Risø Campus
Bygning 109, rum S14

________________________________________
From: Frank Schilder <fr...@dtu.dk>
Sent: Monday, January 20, 2025 1:51 PM
To: Eugen Block
Cc: ceph-users@ceph.io
Subject: [ceph-users] Re: MDS hung in purge_stale_snap_data after populating cache

which is 3758096384. I'm not even sure what the unit is, probably bytes?

As far as I understand the unit is "list items". They can have variable length. On our system about 400G are allocated while filling up the bufferlist.

Best regards,
=================
Frank Schilder
AIT Risø Campus
Bygning 109, rum S14


_______________________________________________
ceph-users mailing list -- ceph-users@ceph.io
To unsubscribe send an email to ceph-users-le...@ceph.io

Reply via email to