For those who would look for an answer, the fix is available in 1.18:
https://issues.apache.org/jira/browse/FLINK-31498
Proposed solution is not to request for TaskManagers if there are some
slots already pending.
On Thu, Jul 4, 2024 at 2:00 PM Alex Nitavsky wrote:
> Hello community,
>
&g
Hello Prabhu,
On your place I would check:
1. That there is no "state leak" in your job, because it seems that state
only accumulates for the job and is never cleaned, e.g. probably some timer
which cleans the state for some key is not configured correctly.
2. Probably you accumulate the state i