Piotr Nowojski created FLINK-25688: -------------------------------------- Summary: Resolve performance degradation with high parallelism when using buffer debloating Key: FLINK-25688 URL: https://issues.apache.org/jira/browse/FLINK-25688 Project: Flink Issue Type: Improvement Components: Runtime / Network Affects Versions: 1.14.3, 1.15.0 Reporter: Piotr Nowojski
As documented in FLINK-25646, currently buffer debloating in Flink, at least in the default configuration, has quite noticeable performance degradation at larger scale. For example throughput can drop by a factor of 4, or even checkpointing times can be increased. Currently it's not clear why is this happening. It looks like increasing the number of buffers per channel from the default ~2 to above 3 (for example via bumping number of floating buffers to value equal or higher then parallelism), seems to be solving this problem, at least on one cluster where buffer debloating has been tested at large scale. Further investigation is required. CC [~akalashnikov] -- This message was sent by Atlassian Jira (v8.20.1#820001)