Hello, We are deploying a flink application cluster in kubernetes, 2 pods one for the JM and the other for the TM.
The problem is when we launch load tests we see that task manager memory usage increases, after the tests are finished and flink stop processing data the memory usage never comes down where it was before, eventually when we launch tests again and again the memory of TM continues to grow until it reaches the memory resource limit specified in the container templates and it get killed because of OOM. Has anyone faced the same issue and what is the best way to investigate this error in order to know the root cause of why the memory usage of the TM never comes down when flink finishes processing. FLink version is 1.16.0. Thanks,