[ceph-users] Re: monitoring drives

2022-10-18 Thread Kai Stian Olstad
On 17.10.2022 12:52, Ernesto Puerta wrote: - Ceph already exposes SMART-based health-checks, metrics and alerts from the devicehealth/diskprediction modules . I find this kind of high-level monitoring more di

[ceph-users] Re: monitoring drives

2022-10-17 Thread Ernesto Puerta
__ > From: Wyll Ingersoll > Sent: Friday, October 14, 2022 10:48 AM > To: Konstantin Shalygin; John Petrini > Cc: Marc; Paul Mezzanini; ceph-users > Subject: [ceph-users] Re: monitoring drives > > Check twice before you click! This email originated from

[ceph-users] Re: monitoring drives

2022-10-14 Thread Fox, Kevin M
users Subject: [ceph-users] Re: monitoring drives Check twice before you click! This email originated from outside PNNL. This looks very useful. Has anyone created a grafana dashboard that will display the collected data ? From: Konstantin Shalygin Sent: Friday, Oc

[ceph-users] Re: monitoring drives

2022-10-14 Thread Wyll Ingersoll
This looks very useful. Has anyone created a grafana dashboard that will display the collected data ? From: Konstantin Shalygin Sent: Friday, October 14, 2022 12:12 PM To: John Petrini Cc: Marc ; Paul Mezzanini ; ceph-users Subject: [ceph-users] Re

[ceph-users] Re: monitoring drives

2022-10-14 Thread Konstantin Shalygin
Hi, You can get this metrics, even wear level, from official smartctl_exporter [1] [1] https://github.com/prometheus-community/smartctl_exporter k Sent from my iPhone > On 14 Oct 2022, at 17:12, John Petrini wrote: > > We run a mix of Samsung and Intel SSD's, our solution was to write a > sc

[ceph-users] Re: monitoring drives

2022-10-14 Thread John Petrini
We run a mix of Samsung and Intel SSD's, our solution was to write a script that parses the output of the Samsung SSD Toolkit and Intel ISDCT CLI tools respectively. In our case, we expose those metrics using node_exporter's textfile collector for ingestion by prometheus. It's mostly the same smart

[ceph-users] Re: monitoring drives

2022-10-14 Thread Marc
> smartctl can very much read sas drives so I would look into that chain > first. I have smartd running and it does recognize the sas drives, however I have collectd is grabbing smart data and I am getting nothing from it. This is all the stuff I am getting from a sata drive # SELECT * FROM "sm

[ceph-users] Re: monitoring drives

2022-10-14 Thread Paul Mezzanini
smartctl can very much read sas drives so I would look into that chain first. Are they behind a raid controller that is masking the smart commands? As for monitoring, we run the smartd service to keep an eye on drives. More often than not I notice weird things with ceph long before smart thr