Since you're using Prometheus I would recommend setting up a PrometheusReporter as described in the metrics documentation and scrape each JM/TM individually. Scraping through the REST API is more expensive and you loose out on a lot of features.
The REST API calls are primarily aimed at the WebUI.

Regardless, as of right now I would doubt that this is a Flink issue, and would recommend heading to the prometheus mailing lists.

On 22/03/2019 17:55, Vishal Santoshi wrote:
A simple query, Is the route to /metrics execute an access to an in memory registry of stats collected OR does it contend with access from JM or do expensive access or computation. I see occasionally our Prometheus scrape fail with the error pasted below. We have had the scrapper do much more elaborate scrape on other systems we maintain so was curious. The server did not have any logs related to the exception and the scraper is . ServiceMonitor from k8s and of course these TMs are hosted no k8s as well

Get http://10.246.254.84:9610/metrics <http://10.246.254.84:9610/metrics>: EOF


Reply via email to