Hi, Flink dev

Lately, we want to develop some tools to:
1. show backpressure operator without manual operation
2. Provide suggestions to mitigate back pressure after checking data skew,
external service RPC etc.
3. Show back pressure history

Could anyone share their experience with such tooling?
Also, I notice backpressure monitoring and detection is mentioned across
multiple places. Could someone help to explain how these connect to each
other? Maybe some of them are outdated? Thanks!

1. The official doc introduces monitoring back pressure through web UI.
https://ci.apache.org/projects/flink/flink-docs-release-1.12/ops/monitoring/back_pressure.html
2. In https://flink.apache.org/2019/07/23/flink-network-stack-2.html, it
says outPoolUsage, inPoolUsage metrics can be used to determine back
pressure.
3. Latest flink version introduces metrics called “isBackPressured" But I
didn't find related documentation on usage.

Best
Lu

Reply via email to