Hi Micah, Brian Olsen proposed a way to publish or archive the slack chat to Discourse Forum <https://discourse.org/>. It's not done yet AFAIK.
Yufei On Tue, Nov 21, 2023 at 10:47 AM Micah Kornfield <emkornfi...@gmail.com> wrote: > Slightly side topic: Are slack channels archived anywhere for offline > consumption (apologies if I missed it on the community page)? > > Thanks, > Micah > > On Tue, Nov 21, 2023 at 6:07 AM Renjie Liu <liurenjie2...@gmail.com> > wrote: > >> Thanks for sharing. >> >> On Tue, Nov 21, 2023 at 21:52 Walaa Eldin Moustafa <wa.moust...@gmail.com> >> wrote: >> >>> We met on Wednesday and created the channel #cdc-read on Iceberg Slack. >>> A summary of the meeting discussion points is there. >>> >>> Thanks, >>> Walaa. >>> >>> On Tue, Nov 21, 2023 at 8:06 AM Renjie Liu <liurenjie2...@gmail.com> >>> wrote: >>> >>>> Hi: >>>> >>>> Is there any update on this topic? >>>> >>>> On Tue, Nov 14, 2023 at 07:25 Yufei Gu <flyrain...@gmail.com> wrote: >>>> >>>>> Hi folks, >>>>> >>>>> We will discuss it this Wednesday(11/15) at 9 am PST. Feel free to >>>>> join if you are interested. >>>>> >>>>> Sync-up for Iceberg CDC View on MOR >>>>> Wednesday, November 15 · 9:00 – 10:00am >>>>> Time zone: America/Los_Angeles >>>>> Google Meet joining info >>>>> Video call link: https://meet.google.com/zef-grqu-cqy >>>>> >>>>> Yufei >>>>> >>>>> >>>>> On Mon, Nov 6, 2023 at 4:39 AM Péter Váry <peter.vary.apa...@gmail.com> >>>>> wrote: >>>>> >>>>>> Hi Team, >>>>>> >>>>>> I was thinking about the possible implementations of a streaming read >>>>>> of MOR tables from Flink. >>>>>> I was checking the Spark code, and found that the feature is also >>>>>> missing from Spark. As Yufei mentioned, the building blocks are there, >>>>>> but >>>>>> the feature is not implemented yet. >>>>>> It would be good to implement the DeletedRowsScanTask and related >>>>>> features, so this feature would be available for both Spark and Flink >>>>>> engines. >>>>>> >>>>>> Thanks, >>>>>> Peter >>>>>> >>>>>> Yufei Gu <flyrain...@gmail.com> ezt írta (időpont: 2023. nov. 3., P, >>>>>> 18:47): >>>>>> >>>>>>> Hi Pucheng, >>>>>>> >>>>>>> In short, we can reuse front-end infrastructure, including the >>>>>>> changelog view procedure and iterators. We need some work from the >>>>>>> reader >>>>>>> side, it is not a trivial one, but some essential building blocks, like >>>>>>> the >>>>>>> `_deleted` metadata column, are there already. >>>>>>> >>>>>>> To get row-level deletes, we will leverage the `_deleted` metadata >>>>>>> column for both pos deletes and eq deletes. Especially, instead of >>>>>>> emitting >>>>>>> equality deletes directly as cdc deleted rows, we resolve the eq >>>>>>> deletes to >>>>>>> actual deleted rows and emit them as CDC delete rows. For example, an eq >>>>>>> delete may delete two data rows. We will emit the 2 actual deleted >>>>>>> rows.We >>>>>>> change the design so that we emit all deleted(pos and eq) rows together >>>>>>> in >>>>>>> the same format. >>>>>>> >>>>>>> The downside is that it is expensive for certain use cases. For >>>>>>> example, it has to scan all data files to resolve global eq deletes. We >>>>>>> can >>>>>>> try to solve this by providing an option to emit eq deletes rows >>>>>>> directly >>>>>>> in the future. Please refer to >>>>>>> https://github.com/apache/iceberg/issues/3941#issuecomment-1081273709 >>>>>>> for more details. >>>>>>> >>>>>>> >>>>>>> Yufei >>>>>>> >>>>>>> >>>>>>> On Thu, Nov 2, 2023 at 9:17 PM Pucheng Yang >>>>>>> <py...@pinterest.com.invalid> wrote: >>>>>>> >>>>>>>> Feature request ticket: >>>>>>>> https://github.com/apache/iceberg/issues/8975 >>>>>>>> >>>>>>>> On Thu, Nov 2, 2023 at 9:16 PM Pucheng Yang <py...@pinterest.com> >>>>>>>> wrote: >>>>>>>> >>>>>>>>> Hi community, >>>>>>>>> >>>>>>>>> I wonder if anyone is interested in having a MOR CDC view feature? >>>>>>>>> My organization is interested in using Flink upsert (MOR) into the >>>>>>>>> Iceberg >>>>>>>>> table, but currently the MOR CDC view is not supported. >>>>>>>>> >>>>>>>>> If we were to support it, do you know how much work it will be? >>>>>>>>> How difficult will that be? Any pointers will be greatly appreciated. >>>>>>>>> >>>>>>>>> Thanks! >>>>>>>>> >>>>>>>>> Best, >>>>>>>>> Pucheng >>>>>>>>> >>>>>>>>