Hi everyone, It seems we have no more comments. So I would like to start a vote tomorrow if there are no further things to discuss.
Please let me know if you have any concerns, thanks! Best, Peter On Thu, Oct 5, 2023, 12:53 Gabor Somogyi <gabor.g.somo...@gmail.com> wrote: > Thanks for the efforts Peter! > > I've just analyzed it through and I think it's useful feature. > > +1 from my side. > > G > > > On Thu, Oct 5, 2023 at 12:35 PM Péter Váry <peter.vary.apa...@gmail.com> > wrote: > > > For the record, after the rename, the new FLIP link is: > > > > > https://cwiki.apache.org/confluence/display/FLINK/FLIP-371%3A+Provide+initialization+context+for+Committer+creation+in+TwoPhaseCommittingSink > > > > Thanks, > > Peter > > > > Péter Váry <peter.vary.apa...@gmail.com> ezt írta (időpont: 2023. okt. > 5., > > Cs, 11:02): > > > > > Thanks Gordon for the comments! > > > > > > 1. I have changed the FLIP name to the one proposed by you. > > > 2. In the Iceberg sink we need access only to the Flink metrics. We > do > > > not specifically need the job ID in the Committer after the SinkV2 > > > migration (more about that later). This is the reason why I have > > stated in > > > the FLIP, that *"We should also provide similar context information > as > > > we do in the Writer’s case, which should be discussed further on the > > > mailing list."*. What I did is: I have just copied most of the > > > org.apache.flink.api.connector.sink2.InitContext [1] properties to > the > > > CommitterInitContext and removed the ones which seemed excessive to > > me. > > > The API in the FLIP is only a draft, and I am open to any > suggestions. > > > > > > In the new Iceberg Sink we need unique ids for the Committables, but we > > > generate them in the SinkV2Aggregator [2] which is placed into the > > > PreCommitTopology. The aggregator has access to the job ID, operator ID > > and > > > checkpoint ID. So no new info is needed on the Committer side there. > > > > > > Thanks, > > > Peter > > > > > > - [1] > > > > > > https://github.com/apache/flink/blob/cd95b560d0c11a64b42bf6b98107314d32a4de86/flink-core/src/main/java/org/apache/flink/api/connector/sink2/Sink.java#L95-L131 > > > - [2] > > > > > > https://github.com/apache/iceberg/pull/8653/files#diff-bfd6a564485ec60b2d53f7aa800451548d4713c5f797e76bcff95d2c8ae05ed1R77-R81 > > > > > > Tzu-Li (Gordon) Tai <tzuli...@apache.org> ezt írta (időpont: 2023. > okt. > > > 5., Cs, 8:16): > > > > > >> Thanks Peter for starting the FLIP. > > >> > > >> Overall, this seems pretty straightforward and overdue, +1. > > >> > > >> Two quick question / comments: > > >> > > >> 1. Can you rename the FLIP to something less generic? Perhaps > > "Provide > > >> initialization context for Committer creation in > > >> TwoPhaseCommittingSink"? > > >> 2. Can you describe why the job ID is needed to be exposed? > Although > > >> it's out of scope for this FLIP, I'm wondering if there's a need to > > do > > >> the > > >> same for the sink writer InitContext. > > >> > > >> Thanks, > > >> Gordon > > >> > > >> On Wed, Oct 4, 2023 at 11:20 AM Martijn Visser < > > martijnvis...@apache.org> > > >> wrote: > > >> > > >> > Hi all, > > >> > > > >> > Peter, Marton, Gordon and I had an offline sync on SinkV2 and I'm > > >> > happy with this first FLIP on the topic. +1 > > >> > > > >> > Best regards, > > >> > > > >> > Martijn > > >> > > > >> > On Wed, Oct 4, 2023 at 5:48 PM Márton Balassi < > > balassi.mar...@gmail.com > > >> > > > >> > wrote: > > >> > > > > >> > > Thanks, Peter. I agree that this is needed for Iceberg and > > beneficial > > >> for > > >> > > other connectors too. > > >> > > > > >> > > +1 > > >> > > > > >> > > On Wed, Oct 4, 2023 at 3:56 PM Péter Váry < > > >> peter.vary.apa...@gmail.com> > > >> > > wrote: > > >> > > > > >> > > > Hi Team, > > >> > > > > > >> > > > In my previous email[1] I have described our challenges > migrating > > >> the > > >> > > > existing Iceberg SinkFunction based implementation, to the new > > >> SinkV2 > > >> > based > > >> > > > implementation. > > >> > > > > > >> > > > As a result of the discussion around that topic, I have created > > the > > >> > first > > >> > > > [2] of the FLIP-s addressing the missing features there. > > >> > > > > > >> > > > The main goal of the FLIP-371 is to extend the currently > existing > > >> > Committer > > >> > > > API by providing an initial context on Committer creation. This > > >> context > > >> > > > will contain - among other, less interesting data - > > >> > > > the SinkCommitterMetricGroup which could be used to store the > > >> generic > > >> > > > commit related metrics, and also provide a way for the Committer > > to > > >> > publish > > >> > > > its own metrics. > > >> > > > > > >> > > > The feature has already been requested through FLINK-25857 [3]. > > >> > > > > > >> > > > Please use this thread to discuss the FLIP related questions, > > >> > proposals, > > >> > > > feedback. > > >> > > > > > >> > > > Thanks, > > >> > > > Peter > > >> > > > > > >> > > > - [1] > > >> https://lists.apache.org/thread/h3kg7jcgjstpvwlhnofq093vk93ylgsn > > >> > > > - [2] > > >> > > > > > >> > > > > > >> > > > >> > > > https://cwiki.apache.org/confluence/display/FLINK/FLIP-371%3A+SinkV2+Committer+imporvements > > >> > > > - [3] https://issues.apache.org/jira/browse/FLINK-25857 > > >> > > > > > >> > > > >> > > > > > >