Hi Abhi,
> We see that even when all the Taskslots of that particular operator are
stuck in an INITIALISING state
Can you include the stack trace of these threads so that we can understand
what the operators are stuck on INITIALISING?
Regards
Keith
On Thu, May 9, 2024 at 6:58 AM Abhi Sagar Khat
Hi Biao,
Thank you for your response. We have tried looking into Thread dumps of
Task Managers before but that's not helping our case.
We see that even when all the Taskslots of that particular operator are
stuck in an INITIALISING state, many of them have already started
processing new data.
Is th
Hi Abhi,
If your case can be reproduced steadily, have your ever tried to get
the thread dump of the TM which the problematic operator resides in?
Maybe we can get more clues with the thread dump to see where the
operator is getting stuck.
Best,
Biao Geng
Abhi Sagar Khatri via user 于2024年4月30日周
Some more context: Our job graph has 5 different Tasks/operators/flink
functions of which we are seeing this issue every time in a particular
operator
We’re using Unaligned checkpoints. With aligned checkpoint we don’t see
this issue but the checkpoint duration in that case is very high and causes