I have started with backporting the source API changes. Some minor conflicts to solve, will need a bit more to finish this.
On Fri, Oct 30, 2020 at 7:25 AM Tzu-Li (Gordon) Tai <tzuli...@apache.org> wrote: > @Stephan Ewen <se...@apache.org> > Are there already plans or ongoing efforts for backporting the list of > FLIP-27 changes that you posted? > > On Thu, Oct 29, 2020 at 7:08 PM Xintong Song <tonysong...@gmail.com> > wrote: > >> Hi folks, >> >> Just to provide some updates concerning the status on the >> test instabilities. >> >> Currently, we have 30 unresolved tickets labeled with `Affects Version` >> 1.11.x. >> >> https://issues.apache.org/jira/browse/FLINK-19775?filter=12348580&jql=project%20%3D%20FLINK%20AND%20resolution%20%3D%20Unresolved%20AND%20affectedVersion%20in%20(1.11.0%2C%201.11.1%2C%201.11.2%2C%201.11.3)%20AND%20labels%20%3D%20test-stability%20ORDER%20BY%20created%20DESC >> >> Among the 30 tickets, 11 of them are: >> - Have occured in the recent 3 months >> - Not confirmed to be pure testability issues >> - Not confirmed to be rare condition cases >> >> It would be nice if someone familiar with these components can take a look >> into these issues. >> >> - https://issues.apache.org/jira/browse/FLINK-17159 (ES6) >> - https://issues.apache.org/jira/browse/FLINK-17912 (Kafka) >> - https://issues.apache.org/jira/browse/FLINK-17949 (Kafka) >> ⁃ https://issues.apache.org/jira/browse/FLINK-18444 (Kafka) >> - https://issues.apache.org/jira/browse/FLINK-18634 (Kafka) >> - https://issues.apache.org/jira/browse/FLINK-18648 (Kafka) >> - https://issues.apache.org/jira/browse/FLINK-18807 (Kafka) >> - https://issues.apache.org/jira/browse/FLINK-19369 (BlobClientTest) >> - https://issues.apache.org/jira/browse/FLINK-19436 (TPCDS) >> - https://issues.apache.org/jira/browse/FLINK-19690 (Format/Parquet) >> - https://issues.apache.org/jira/browse/FLINK-19775 >> (SystemProcessingTimeServiceTest) >> >> Thank you~ >> >> Xintong Song >> >> >> >> On Thu, Oct 29, 2020 at 10:21 AM Jingsong Li <jingsongl...@gmail.com> >> wrote: >> >> > +1 to backport the FLIP-27 adjustments to 1.11.x. >> > >> > If possible, that would be great. Many people are looking forward to the >> > FLIP-27 interface, but they don't want to take the risk to upgrade to >> 1.12 >> > (And wait 1.12). After all, 1.11 is a relatively stable version. >> > >> > Best, >> > Jingsong >> > >> > On Thu, Oct 29, 2020 at 1:24 AM Stephan Ewen <se...@apache.org> wrote: >> > >> > > Thanks for starting this. >> > > >> > > +1 form my side to backport the FLIP-27 adjustments to 1.11.x. >> > > >> > > There were quite a few changes, and I think we need to cherry-pick >> them >> > all >> > > to not get some inconsistent mix of changes and many merge conflicts. >> > > I made a list below of what we need to add to "release-1.11". >> > > >> > > *#### Core Source API Changes to backport (in REVERSE order)* >> > > >> > > (Use: "git log >> > > flink-core/src/main/java/org/apache/flink/api/connector/source") >> > > >> > > commit 162c072e9265a7b6dd9d6f5459eb7974091c4c4e >> > > [FLINK-19492][core] Consolidate Source Events between Source API and >> > Split >> > > Reader API >> > > >> > > commit 428fbbad85379f71a9fbf1a1ab3e3dcfa9006436 >> > > [FLINK-19205][core] Add access to configuration and hostname in the >> > > SourceReaderContext >> > > >> > > commit 723e1790744ddc78d19c7c978442af1383f38d33 >> > > [hotfix][core] Add to Source Enumerator convenience methods to assign >> > > single split >> > > >> > > *#### Connector Base Changes to Backport (in REVERSE order)* >> > > >> > > (Use: "git log flink-connectors/flink-connector-base") >> > > >> > > commit 401f56fe9d6b0271260edf9787cdcbfe4d03874d >> > > [FLINK-19427][FLINK-19489][tests] Fix test conditions for >> > > 'SplitFetcherTest.testNotifiesWhenGoingIdleConcurrent()' >> > > >> > > commit 68c5c2ff779d82a1ff81ffaf60d8a1b283797db1 >> > > [FLINK-19448][connector base] Explicitly check for un-expected >> condition >> > > that would leave an inconsistent state >> > > >> > > commit 162c072e9265a7b6dd9d6f5459eb7974091c4c4e >> > > [FLINK-19492][core] Consolidate Source Events between Source API and >> > Split >> > > Reader API >> > > >> > > commit c1ca7a4c7c21ec8868c14cdddd43c559625b794c >> > > [refactor][tests] Move some source test utils from >> flink-connector-base >> > to >> > > flink-core >> > > >> > > commit ee5c4c211c35c70d28252363bbc8400453609977 >> > > [FLINK-19251][connectors] Avoid confusing queue handling in >> > > "SplitReader.handleSplitsChanges()" >> > > >> > > commit 5abef56b2bf85bcac786f6b16b6899b6cced7176 >> > > [FLINK-19250][connectors] Fix error propagation in connector base >> > > (SplitFetcherManager). >> > > >> > > commit 8fcca837c55a9216595ee4c03038b52747098dbb >> > > [hotfix][connectors] Improve JavaDocs for SingleThreadFetcherManager >> > > >> > > commit 4700bb5dde3303cbe98882f6beb7379425717b01 >> > > [FLINK-19225][connectors] Various small improvements to >> SourceReaderBase >> > > (part 2) >> > > >> > > commit 12261c6b7ed6478a9b9f6a69cb58246b83cab9b7 >> > > [FLINK-17393][connectors] (follow-up) Wakeup the SplitFetchers more >> > > elegantly. >> > > >> > > commit c60aaff0249bfd6b5871b7f82e03efc487a54d6b >> > > [hotfix][tests] Extend test coverage for FutureCompletingBlockingQueue >> > > >> > > commit cef8a587d7fd2fe64cc644da5ed095d82e46f631 >> > > [FLINK-19245][connectors] Set default capacity for >> > > FutureCompletingBlockingQueue. >> > > >> > > commit 4ea95782b4c6a2538153d4d16ad3f4839c7de0fb >> > > [FLINK-19223][connectors] Simplify Availability Future Model in Base >> > > Connector >> > > >> > > commit 511857049ba30c8ff0ee56da551fa4a479dc583e >> > > [FLINK-18128][connectors] Ensure idle split fetchers lead to >> availability >> > > notifications. >> > > >> > > commit a8206467af0830dcb89623ea068b5ca3b3450c92 >> > > [refactor][core] Eagerly initialize the FetchTask to support proper >> unit >> > > testing >> > > >> > > commit 3b2f54bcb437f98e6137c904045cc51072b5c06b >> > > [hotfix][tests] Move constants in SplitFetcherTest relevant to only >> one >> > > test into test method >> > > >> > > commit d7625760a75a508bf05bcddc380bb4d62ee1743e >> > > [FLINK-19225][connectors] Various small improvements to >> SourceReaderBase >> > > >> > > commit a5b0d3297748c1be47ad579a88f24df2255a8df1 >> > > [FLINK-17393][connectors] Wakeup the SplitFetchers more elegantly. >> > > >> > > commit f42a3ebc3e81a034b7221a803c153636fef34903 >> > > [FLINK-18680][connectors] Make connector base RecordsWithSplitIds more >> > > lightweight. >> > > >> > > commit e3d273de822b085183d09b275a445879ff94b350 >> > > [FLINK-19162][connectors] Add 'recycle()' to the RecordsWithSplitIds >> to >> > > support reuse of heavy objects. >> > > >> > > commit 8ebc464c2520453a70001cd712abc8dee6ee89e0 >> > > [hotfix][testing] Add a set of parameterizable testing mocks for the >> > Split >> > > Reader API >> > > >> > > commit 930a07438be1185388d7150640f294dfe2a5d378 >> > > [hotfix][connectors] Add RequestSplitEvent to 'flink-connector-base' >> > > >> > > commit e945ce8a933bc378844782f784ca473c767ca159 >> > > [refactor][connectors] Backport of the connector-base exception >> handling >> > > from the Kafka Connector Pull Request >> > > >> > > >> > > On Wed, Oct 28, 2020 at 5:49 PM Steven Wu <stevenz...@gmail.com> >> wrote: >> > > >> > > > I would love to see this FLIP-27 source interface improvement [1] >> made >> > to >> > > > 1.11.3. >> > > > >> > > > [1] https://issues.apache.org/jira/browse/FLINK-19698 >> > > > >> > > > On Wed, Oct 28, 2020 at 12:32 AM Tzu-Li (Gordon) Tai < >> > > tzuli...@apache.org> >> > > > wrote: >> > > > >> > > > > Thanks for the replies so far! >> > > > > >> > > > > Just to provide a brief update on the status of blockers for >> 1.11.3 >> > so >> > > > far: >> > > > > >> > > > > >> > > > > *PR opened, pending reviewer*- [FLINK-19717] >> > SourceReaderBase.pollNext >> > > > may >> > > > > return END_OF_INPUT if SplitReader.fetch throws ( >> > > > > https://github.com/apache/flink/pull/13776) >> > > > > >> > > > > *PR opened, reviewed + close to being merged* >> > > > > - [FLINK-19741] Timer service should skip restoring from raw keyed >> > > stream >> > > > > if it isn't the writer ( >> https://github.com/apache/flink/pull/13761) >> > > > > - [FLINK-19748] Raw keyed stream key group iterator should be >> > skipping >> > > > > unwritten key groups (https://github.com/apache/flink/pull/13772) >> > > > > >> > > > > *Merged* >> > > > > - [FLINK-19154] Application mode deletes HA data in case of >> suspended >> > > > > ZooKeeper connection >> > > > > - [FLINK-19569] Upgrade ICU4J to 67.1+ >> > > > > >> > > > > Right now as it seems, progress is mainly blocked on a reviewer >> for >> > > > > FLINK-19717. >> > > > > Meanwhile, Xintong is keeping an eye on test instabilities [1] to >> see >> > > if >> > > > > there are any fixes that should be applied to `release-1.11`. >> > > > > >> > > > > This is also a reminder, that if there are other blockers that we >> > need >> > > to >> > > > > be aware of, or a need to re-establish estimated time for getting >> > fixes >> > > > in >> > > > > and delay the RC for 1.11.3, please do let us know! >> > > > > >> > > > > Cheers, >> > > > > Gordon >> > > > > >> > > > > [1] >> > > > > >> > > > > >> > > > >> > > >> > >> https://issues.apache.org/jira/issues/?jql=project%20%3D%20FLINK%20AND%20status%20in%20(Open%2C%20%22In%20Progress%22%2C%20Reopened)%20AND%20affectedVersion%20in%20(1.11.0%2C%201.11.1%2C%201.11.2)%20AND%20labels%20%3D%20test-stability >> > > > > >> > > > > On Mon, Oct 26, 2020 at 9:43 PM Kostas Kloudas < >> kklou...@apache.org> >> > > > > wrote: >> > > > > >> > > > > > +1 for releasing Flink 1.11.3 as it contains a number of >> important >> > > > > > fixes and thanks Gordon and Xintong for volunteering. >> > > > > > >> > > > > > Cheers, >> > > > > > Kostas >> > > > > > >> > > > > > On Mon, Oct 26, 2020 at 4:37 AM Yu Li <car...@gmail.com> wrote: >> > > > > > > >> > > > > > > +1 for releasing Flink 1.11.3, and thanks Gordon and Xintong >> for >> > > > > > > volunteering as our release managers. >> > > > > > > >> > > > > > > Best Regards, >> > > > > > > Yu >> > > > > > > >> > > > > > > >> > > > > > > On Mon, 26 Oct 2020 at 09:45, Xintong Song < >> > tonysong...@gmail.com> >> > > > > > wrote: >> > > > > > > >> > > > > > > > Thanks Gordan for starting this discussion. >> > > > > > > > My pleasure to help with the release process. >> > > > > > > > >> > > > > > > > Thank you~ >> > > > > > > > >> > > > > > > > Xintong Song >> > > > > > > > >> > > > > > > > >> > > > > > > > >> > > > > > > > On Fri, Oct 23, 2020 at 11:29 PM Till Rohrmann < >> > > > trohrm...@apache.org >> > > > > > >> > > > > > > > wrote: >> > > > > > > > >> > > > > > > > > Thanks for starting this discussion Gordon. There are over >> > 100 >> > > > > issues >> > > > > > > > > which are fixed for 1.11.3. Hence +1 for a soonish 1.11.3 >> > > > release. >> > > > > > Thanks >> > > > > > > > > for volunteering as our release managers Gordon and >> Xintong! >> > > > > > > > > >> > > > > > > > > Cheers, >> > > > > > > > > Till >> > > > > > > > > >> > > > > > > > > On Fri, Oct 23, 2020 at 5:02 PM Tzu-Li (Gordon) Tai < >> > > > > > tzuli...@apache.org >> > > > > > > > > >> > > > > > > > > wrote: >> > > > > > > > > >> > > > > > > > >> Hi, >> > > > > > > > >> >> > > > > > > > >> Xintong and I would like to start a discussion for >> releasing >> > > > Flink >> > > > > > > > 1.11.3 >> > > > > > > > >> soon. >> > > > > > > > >> >> > > > > > > > >> It seems like we already have a few pressing issues that >> > needs >> > > > to >> > > > > be >> > > > > > > > >> included in a new hotfix release: >> > > > > > > > >> >> > > > > > > > >> - Heap-based timers’ restore behaviour is causing a >> > > critical >> > > > > > recovery >> > > > > > > > >> issue for StateFun [1] [2] [3]. >> > > > > > > > >> - There are several robustness issues for the FLIP-27 >> new >> > > > > source >> > > > > > API, >> > > > > > > > >> such as [4]. We already have some users using the >> FLIP-27 >> > > API >> > > > > > with >> > > > > > > > >> 1.11.x, >> > > > > > > > >> so it would be important to get those fixes in for >> 1.11.x >> > > as >> > > > > > well. >> > > > > > > > >> >> > > > > > > > >> Apart from the issues that are already marked as blocker >> for >> > > > > 1.11.3 >> > > > > > in >> > > > > > > > our >> > > > > > > > >> JIRA [5], please let us know in this thread if there is >> > > already >> > > > > > ongoing >> > > > > > > > >> work for other important fixes that we should try to >> > include. >> > > > > > > > >> >> > > > > > > > >> Xintong and I would like to volunteer for managing this >> > > release, >> > > > > and >> > > > > > > > will >> > > > > > > > >> try to communicate the priority of pending blockers over >> the >> > > > next >> > > > > > few >> > > > > > > > >> days. >> > > > > > > > >> Since the aforementioned issues are quite critical, we’d >> > like >> > > to >> > > > > aim >> > > > > > > > >> for a *feature >> > > > > > > > >> freeze by the end of next week (Oct. 30th)* and start the >> > > > release >> > > > > > voting >> > > > > > > > >> process the week after. >> > > > > > > > >> If that is too short of a notice and you might need more >> > time, >> > > > > > please >> > > > > > > > let >> > > > > > > > >> us know! >> > > > > > > > >> >> > > > > > > > >> Cheers, >> > > > > > > > >> Gordon >> > > > > > > > >> >> > > > > > > > >> [1] https://issues.apache.org/jira/browse/FLINK-19692 >> > > > > > > > >> [2] https://issues.apache.org/jira/browse/FLINK-19741 >> > > > > > > > >> [3] https://issues.apache.org/jira/browse/FLINK-19748 >> > > > > > > > >> [4] https://issues.apache.org/jira/browse/FLINK-19717 >> > > > > > > > >> [5] >> > > > > > > > >> >> > > > > > > > >> >> > > > > > > > >> > > > > > >> > > > > >> > > > >> > > >> > >> https://issues.apache.org/jira/issues/?jql=project%20%3D%20FLINK%20AND%20priority%20%3D%20Blocker%20AND%20fixVersion%20%3D%201.11.3 >> > > > > > > > >> >> > > > > > > > > >> > > > > > > > >> > > > > > >> > > > > >> > > > >> > > >> > >> > >> > -- >> > Best, Jingsong Lee >> > >> >