Re: Job gets stuck when using kafka transactions and eventually crashes

2023-01-25 Thread Martijn Visser
If data flows normally, watermarks will progress to downstream operators. If you have no records flowing in a partition of a stream and you don't have an idleness configured, that partition will hold back the progress of watermarks. Watermarks are essential for the checkpointing mechanism, which ul

Re: Job gets stuck when using kafka transactions and eventually crashes

2023-01-25 Thread Vishal Surana
Can you elaborate a bit more? While idleness is not what we’re seeing now, it could perhaps be an issue later on. What about a certain partition going idle will result in state buildup? Thanks, Vishal On 25 Jan 2023 at 9:14 PM +0530, Martijn Visser , wrote: > Hi Vishal, > > Could idleness be an

Re: Job gets stuck when using kafka transactions and eventually crashes

2023-01-25 Thread Martijn Visser
Hi Vishal, Could idleness be an issue? I could see that if idleness occurs and the Kafka Source not going in an idle state, that more internal state (to commit Kafka transactions) can build up over time that ultimately causes an out of memory problem. See https://nightlies.apache.org/flink/flink-d

Re: Job gets stuck when using kafka transactions and eventually crashes

2023-01-23 Thread Vishal Surana
Could it be that link is unable to commit offsets to Kafka? I know that blinks checkpoint mechanism isn’t tied to its ability to commit offset but at the same time, we’ve seen that the job can take hours to commit offsets while checkpoints go through successfully during that period. But with Kaf