Re: Mysql CDC: support for parallelism

2024-10-29 Thread Michael Marino
Hi Hang, Thanks for the response. Unfortunately, this doesn't work for me, it still blocks and does not propagate the watermark downstream. At the moment, the only solution for me is to do set table.exec.source.idle-timeout. Thanks, Mike On Mon, Oct 28, 2024 at 9:29 AM Hang Ruan wrote: > Hi,

Re: Mysql CDC: support for parallelism

2024-10-28 Thread Hang Ruan
Hi, Michael. MySQL CDC source has the parallelism 1 when reading binlog events to keep their order. And other subtasks will stop reading data. For your question, you could set the option 'scan.incremental.close-idle-reader.enabled'='true'[1] in your cdc table to let the source close the idle subta

Re: Mysql CDC: support for parallelism

2024-10-24 Thread Michael Marino
Let me quickly follow up on this: - I missed noting that I *was* setting the server-id value to a range. - I just realized that if I do a hard restart and start without a snapshot, then this works, i.e. the multiple sub-tasks receive events and the watermarking/processing progresses. This is, howe

Mysql CDC: support for parallelism

2024-10-24 Thread Michael Marino
Hey all, We are working to scale one of our Flink Jobs (using Table API mostly, some DataStream) where we are using a MySQL CDC table as a source for enrichment. What I've noticed is that, when I increase the parallelism of the job (e.g. to 2), the CDC table source has 2 tasks, but only one of th