Hi Mohamad,
>> I'm currently chewing on an issue regarding "slow requests are blocked". I'd
>> like to identify the OSD that is causing those events
>> once the cluster is back to HEALTH_OK (as I have no monitoring yet that
>> would get this info in realtime).
>>
>> Collecting this information could help identify aging disks if you were able
>> to accumulate and analyze which OSD had blocking
>> requests in the past and how often those events occur.
>>
>> My research so far let's me think that this information is only available as
>> long as the requests are actually blocked. Is this
>> correct?
>
> I think this is what you're looking for:
>
> $> ceph daemon osd.X dump_historic_slow_ops
>
> which gives you recent slow operations, as opposed to
>
> $> ceph daemon osd.X dump_blocked_ops
>
> which returns current blocked operations. You can also add a filter to
> those commands.
Thanks for these commands. I'll have a look into those. If I understand these
correctly it means that I need to run these at each
server for each OSD instead of at a central location, is that correct?
Regards,
Uwe
_______________________________________________
ceph-users mailing list
[email protected]
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com