[prometheus-users] How to effectively monitor abnormal occurrences of CPU and memory usage in containers using PrometheusQL expressions?

2024-05-29 Thread 'yuan xiao' via Prometheus Users
Scenario 1: Monitoring sudden spikes in container memory usage, such as going from 300M in one minute to 11G in the next minute. How can we achieve effective monitoring and alerting in such cases? Scenario 2: Monitoring continuous memory growth of containers, such as a sustained increase in cont

[prometheus-users] The impact of reducing the Prometheus scrape interval

2024-05-29 Thread 'yuan xiao' via Prometheus Users
What are the impacts of modifying the scraping interval of Prometheus from the default 1 minute to 30 seconds? For example, how should monitoring metrics and storage be calculated? Will they become twice the original? Provide the approximate calculation formulas and methods for Prometheus storag