Hi, Flink dev Lately, we want to develop some tools to: 1. show backpressure operator without manual operation 2. Provide suggestions to mitigate back pressure after checking data skew, external service RPC etc. 3. Show back pressure history
Could anyone share their experience with such tooling? Also, I notice backpressure monitoring and detection is mentioned across multiple places. Could someone help to explain how these connect to each other? Maybe some of them are outdated? Thanks! 1. The official doc introduces monitoring back pressure through web UI. https://ci.apache.org/projects/flink/flink-docs-release-1.12/ops/monitoring/back_pressure.html 2. In https://flink.apache.org/2019/07/23/flink-network-stack-2.html, it says outPoolUsage, inPoolUsage metrics can be used to determine back pressure. 3. Latest flink version introduces metrics called “isBackPressured" But I didn't find related documentation on usage. Best Lu