Re: JdbcIO

2022-04-22 Thread Eric Berryman
Does an unbounded JdbcIO exist, or would I need to wrap the existing one in a spilttable DoFn? Or maybe there is an easier way to do it? Thank you again, Eric On Wed, Apr 20, 2022, 21:59 Ahmet Altay wrote: > /cc @Pablo Estrada @John Casey > > > On Wed, Apr 20, 2022 at 6:29 PM Eric Berryman

Re: JdbcIO

2022-04-22 Thread Austin Bennett
Without getting into the super specifics of your use-case, it sounds like you might want to checkout the DebeziumIO for CDC ( Change Data Capture ). I think DebeziumIO can generally handle even much more complex use cases than it sounds like you are trying for. Some pointers/talks from last year's

[Question] Beam DropFields PTransform automatically unnesting remaining fields

2022-04-22 Thread Brian Daugavietis (LCL)
Hi, I have a question about the behavior of the beam built-in DropFields PTransform. If you drop only some fields in a nested row then the other fields in the same nested row are automatically unnested. For example, if a Row originally has columns "A.b", "A.c" and "B.c" and if I use DropField

Re: JdbcIO

2022-04-22 Thread Alexey Romanenko
I don’t think it exists. Do you really need to have an unbounded pipeline, meaning that the data will continuously arrive, or just re-running a batch pipeline once per some amount of time or externally triggered by some signal shouldn’t be enough? — Alexey > On 22 Apr 2022, at 13:40, Eric Be