Re: [ceph-users] Snap trim queue length issues

Piotr Dałek Fri, 15 Dec 2017 01:02:17 -0800

On 17-12-14 05:31 PM, David Turner wrote:

I've tracked this in a much more manual way. I would grab a random subset[..]
This was all on a Hammer cluster. The changes to the snap trimming queuesgoing into the main osd thread made it so that our use case was not viableon Jewel until changes to Jewel that happened after I left. It's excitingthat this will actually be a reportable value from the cluster.
Sorry that this story doesn't really answer your question, except to saythat people aware of this problem likely have a work around for it. HoweverI'm certain that a lot more clusters are impacted by this than are aware ofit and being able to quickly see that would be beneficial to troubleshootingproblems. Backporting would be nice. I run a few Jewel clusters that havesome VM's and it would be nice to see how well the cluster handle snaptrimming. But they are much less critical on how much snapshots they do.


Thanks for your response, it pretty much confirms what I though:

- users aware of issue have their own hacks that don't need to be efficientor convenient.- users unaware of issue are, well, unaware and at risk of serious servicedisruption once disk space is all used up.


Hopefully it'll be convincing enough for devs. ;)

--
Piotr Dałek
piotr.da...@corp.ovh.com
https://www.ovh.com/us/
_______________________________________________
ceph-users mailing list
ceph-users@lists.ceph.com
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

Re: [ceph-users] Snap trim queue length issues

Reply via email to