I have a large number of containers. I learned that the following
configuration can monitor a single container down. How to configure it to
monitor all containers and send the container name once a container is down.
- name: containers
rules:
- alert: jenkins_down
expr: absent(container_memory_usage_bytes{name="jenkins"})
for: 30s
labels:
severity: critical
annotations:
summary: "Jenkins down"
description: "Jenkins container is down for more than 30 seconds."
--
You received this message because you are subscribed to the Google Groups
"Prometheus Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email
to [email protected].
To view this discussion on the web visit
https://groups.google.com/d/msgid/prometheus-users/48724d22-3853-4802-bf1c-da2928c885c9n%40googlegroups.com.