Hi Jincheng & Hequn Thanks for driving the releasing of 1.8.3.
I am now working on FLINK-14735. The fix avoids duplicated input checking when scheduling ALL-to-ALL connected downstream consumers with ALL input constraints. The duplicated checking can cause severe performance issues for large scale jobs. So I hope the fix could be released with 1.8.3. The fix is already merged into master, and is now in the process of backporting to 1.8. Thanks, Zhu Zhu Ufuk Celebi <u...@apache.org> 于2019年11月15日周五 下午11:54写道: > Thanks Chesnay. > > I'm also +1 to release 1.8.3 asap without the changes for the Jackson > version bump and leave those for a future release. Realistically, the > flink-shaded release will take until mid next week or end of next week. But > please correct me if you think that it should not take that long or it's OK > to block the 1.8.3 release on the flink-shaded release. > > – Ufuk > > > On Fri, Nov 15, 2019 at 2:27 PM Chesnay Schepler <ches...@apache.org> > wrote: > > > I've kicked off a discussion about the next flink-shaded release, and > > have opened PRs for adding the opt-in profile to 1.8/1.9. > > > > On 15/11/2019 13:54, Hequn Cheng wrote: > > > That's great, thank you very much! Ideally, we can kick off the release > > > vote for the first RC of 1.8.3 within next week. :) > > > > > > On Fri, Nov 15, 2019 at 8:47 PM Chesnay Schepler <ches...@apache.org> > > wrote: > > > > > >> I'm not aware of any more planned changes to flink-shaded; so we could > > >> start the release right away. > > >> > > >> On 15/11/2019 13:44, Hequn Cheng wrote: > > >>> Hi, > > >>> > > >>> @Chesnay Thanks a lot for the explanation. +1 to the opt-in approach > > for > > >>> 1.8/1.9. > > >>> @Ufuk Thank you for the nice summary. > > >>> > > >>> Looks good so far except that we need to postpone 1.8.3 a bit to > first > > >> do a > > >>> flink-shaded release. > > >>> BTW, @chesnay when would we plan to release the flink-shaded with > > >> upgraded > > >>> Jackson? > > >>> > > >>> Best, Hequn > > >>> > > >>> On Fri, Nov 15, 2019 at 7:43 PM Chesnay Schepler <ches...@apache.org > > > > >> wrote: > > >>>> One small modification: the flink-shaded upgrade does not have to be > > >>>> part of the profile; since it is only intended for internal use > anyway > > >>>> (and thus has limited exposure) we can be pretty sure this doesn't > > break > > >>>> anything. > > >>>> > > >>>> On 15/11/2019 12:23, Chesnay Schepler wrote: > > >>>>> Ufuk's summary is correct. > > >>>>> > > >>>>> There's a slight caveat in that we'd also have to bump the > > >>>>> shade-plugin to 3.1.1 since it otherwise fails on jackson, > > >>>>> but I have no concerns about this change. > > >>>>> > > >>>>> On 15/11/2019 12:19, Ufuk Celebi wrote: > > >>>>>> The opt-in approach seems reasonable to me. +1 to include the > > >>>>>> profiles in > > >>>>>> 1.8 and 1.9 without changing the default versions (including the > > >> default > > >>>>>> version of flink-shaded). > > >>>>>> > > >>>>>> As far as I can tell, the next steps would be: > > >>>>>> > > >>>>>> 1) Release flink-shaded with upgraded Jackson > > >>>>>> 2a) Bump the flink-shaded version by default in master > > >>>>>> 2b) Create opt-in profiles for 1.8 and 1.9 (the opt-in profiles > > >>>>>> should also > > >>>>>> cover the upgrade to the most recent flink-shaded version) > > >>>>>> > > >>>>>> @Chesnay: is this a correct summary? > > >>>>>> > > >>>>>> Note this would block the 1.8.3 release on step 1. As an upside, > we > > >>>>>> might > > >>>>>> get some additional feedback until the 1.10 release with these > > >>>>>> profiles in > > >>>>>> case users make use of them with 1.8/1.9. > > >>>>>> > > >>>>>> – Ufuk > > >>>>>> > > >>>>>> On Fri, Nov 15, 2019 at 12:08 PM Chesnay Schepler < > > ches...@apache.org > > >>>>>> wrote: > > >>>>>>> The opt-in approach would only be used for 1.8.3 / 1.9.2; on > master > > >>>>>>> (and > > >>>>>>> thus starting from 1.10.0) it's not opt-in. > > >>>>>>> > > >>>>>>> I have only proposed it as an opt-in because a) we usually do not > > >> bump > > >>>>>>> dependencies in bugfix releases and b) it's a short-term change > > that > > >> we > > >>>>>>> aren't allowing to mature properly. > > >>>>>>> In contrast, the 1.10 release is significantly further away, > hence > > no > > >>>>>>> opt-in. > > >>>>>>> > > >>>>>>> Hence, I'm not concerned about such kind of ugprades being more > > >> common > > >>>>>>> in the future. > > >>>>>>> > > >>>>>>> We can certainly support every jackson version that fixes these > > >>>>>>> vulnerabilities; individual modules can always use a different > > >> version > > >>>>>>> (that hopefully includes the fixes). > > >>>>>>> Ideally of course we'd only be using 1 version, but that may or > may > > >> not > > >>>>>>> be feasible. > > >>>>>>> > > >>>>>>> On 15/11/2019 04:07, Hequn Cheng wrote: > > >>>>>>>> Hi Chesnay, > > >>>>>>>> > > >>>>>>>> Great to hear that jackson-2.10.1 works well on master. Really a > > >> good > > >>>>>> job! > > >>>>>>>> - Whether backport this change to 1.8/1.9 > > >>>>>>>> I had taken a quick look at the security vulnerabilities, some > of > > >> them > > >>>>>>>> seem can lead to high-security problems, thus from my point of > > view, > > >>>>>>>> I'm in favor of adding the fix into 1.9/1.8. However, I would > like > > >> to > > >>>>>>>> trust your judgment as you are more professional at this > problem. > > >>>>>>>> > > >>>>>>>> - How to port this change to 1.8/1.9 > > >>>>>>>> I think providing an opt-in upgrade is a good idea. Another > > question > > >>>>>>>> here is whether do we plan to support multi jackson versions > that > > >> have > > >>>>>>>> eliminated the security vulnerabilities. If we only plan to > > support > > >>>>>>>> 2.10.1, I would like to make it a non-opt-in upgrade. As an > > option, > > >>>>>>>> users can downgrade the flink version if meet problems using the > > new > > >>>>>>>> version. Of course, we will try our best to make the new release > > out > > >>>>>>>> of question. > > >>>>>>>> Another concern of making it an opt-in upgrade is, it will make > > our > > >>>>>>>> build unlikely convergence as more and more build options will > be > > >>>>>>>> added when we upgrade a commonly used lib like this one. > > >>>>>>>> > > >>>>>>>> What do you think? > > >>>>>>>> > > >>>>>>>> Best, Hequn > > >>>>>>>> > > >>>>>>>> On Thu, Nov 14, 2019 at 6:00 PM Chesnay Schepler < > > >> ches...@apache.org > > >>>>>>>> <mailto:ches...@apache.org>> wrote: > > >>>>>>>> > > >>>>>>>> So here's the state of things: > > >>>>>>>> > > >>>>>>>> > > >>>>>>>> The master of flink-shaded now uses jackson 2.10.1, which > > >>>>>>>> eliminates a whole category of security vulnerabilities. > > >>>>>>>> The flink master works perfectly fine with that version; > > 1.9 > > >> will > > >>>>>>>> likely do so too and 1.8 would require a minor > adjustment. > > >>>>>>>> > > >>>>>>>> Hence, there may be value in first doing a flink-shaded > > >>>>>>>> release so > > >>>>>>>> we can eliminate these vulnerabilities in 1.8.3 and > 1.9.2 . > > >>>>>>>> > > >>>>>>>> > > >>>>>>>> As for other jackson dependencies (coming from calcite, > > kafka, > > >>>>>>>> kinesis), I ran the unit and end-to-end tests of master > > >> yesterday > > >>>>>>>> will /all /jackson dependencies set to 2.10.1, and they > > >> passed. I > > >>>>>>>> will open a PR soon-ish for making this change on master. > > >>>>>>>> > > >>>>>>>> The question now is whether we want to backport this > > change to > > >>>>>>>> 1.8/1.9 . > > >>>>>>>> Some code paths /may /not be covered by our tests, and > > >> transitive > > >>>>>>>> jackson users /might /run into issues. > > >>>>>>>> Alternatively, we could set this up as an opt-in upgrade, > > by > > >>>>>>>> adding a separate profile that bumps the versions. This > > would > > >>>>>>>> present users/providers who are concerned about the > > >>>>>>>> vulnerabilities an easy workaround, at the risk of /some > > >> /things > > >>>>>>>> /maybe /not working. > > >>>>>>>> > > >>>>>>>> On 14/11/2019 03:16, Hequn Cheng wrote: > > >>>>>>>>> Hi Chesnay, Jincheng > > >>>>>>>>> > > >>>>>>>>> Sure, I think it's good to have these fixes. > > >>>>>>>>> Thanks a lot for providing the information about the > > security > > >>>>>>>>> vulnerabilities! @Chesnay > > >>>>>>>>> > > >>>>>>>>> Best, Hequn > > >>>>>>>>> > > >>>>>>>>> On Thu, Nov 14, 2019 at 10:07 AM jincheng sun< > > >>>>>> sunjincheng...@gmail.com> <mailto:sunjincheng...@gmail.com> > > >>>>>>>>> wrote: > > >>>>>>>>> > > >>>>>>>>>> +1 for try to eliminate the security vulnerabilities. > > Great > > >>>>>> thanks for > > >>>>>>>>>> doing this important work, Chesnay! > > >>>>>>>>>> What do you think Hequn ? > > >>>>>>>>>> > > >>>>>>>>>> Best, > > >>>>>>>>>> Jincheng > > >>>>>>>>>> > > >>>>>>>>>> Chesnay Schepler<ches...@apache.org> > > >>>>>>>>>> <mailto:ches...@apache.org> > > >>>>>> 于2019年11月13日周三 下午5:17写道: > > >>>>>>>>>>> It would be great if you could give me a day or 2 to > > check > > >> how > > >>>>>> easy it > > >>>>>>>>>>> would be to bump the various jackson dependencies to > > >>>>>>>>>>> eliminate a > > >>>>>> few > > >>>>>>>>>>> security vulnerabilities. > > >>>>>>>>>>> > > >>>>>>>>>>> On 09/11/2019 05:10, jincheng sun wrote: > > >>>>>>>>>>>> Hi Flink devs, > > >>>>>>>>>>>> > > >>>>>>>>>>>> It has been more than 2 months since the 1.8.2 > > released. > > >> So, > > >>>>>> What do > > >>>>>>>>>> you > > >>>>>>>>>>>> think about releasing Flink 1.8.3 soon? > > >>>>>>>>>>>> > > >>>>>>>>>>>> We already have many important bug fixes in the > > >> release-1.8 > > >>>>>> branch (29 > > >>>>>>>>>>>> resolved issues). > > >>>>>>>>>>>> > > >>>>>>>>>>>> Most notable fixes are: > > >>>>>>>>>>>> > > >>>>>>>>>>>> - FLINK-14010 Dispatcher & JobManagers don't give up > > >>>>>>>>>>>> leadership > > >>>>>> when AM > > >>>>>>>>>>> is > > >>>>>>>>>>>> shut down > > >>>>>>>>>>>> - FLINK-14315 NPE with > JobMaster.disconnectTaskManager > > >>>>>>>>>>>> - FLINK-12848 Method equals() in RowTypeInfo should > > >> consider > > >>>>>>>>>> fieldsNames > > >>>>>>>>>>>> - FLINK-12342 Yarn Resource Manager Acquires Too Many > > >>>>>>>>>>>> Containers > > >>>>>>>>>>>> - FLINK-14589 Redundant slot requests with the same > > >>>>>> AllocationID leads > > >>>>>>>>>> to > > >>>>>>>>>>>> inconsistent slot table > > >>>>>>>>>>>> > > >>>>>>>>>>>> Furthermore, the following critical issues is in > > progress, > > >>>>>> maybe we can > > >>>>>>>>>>>> wait for it if it is not too much effort. > > >>>>>>>>>>>> > > >>>>>>>>>>>> - FLINK-13184 Starting a TaskExecutor blocks the > > >>>>>> YarnResourceManager's > > >>>>>>>>>>> main > > >>>>>>>>>>>> thread > > >>>>>>>>>>>> > > >>>>>>>>>>>> Please let me know what you think? > > >>>>>>>>>>>> > > >>>>>>>>>>>> Best, > > >>>>>>>>>>>> Jincheng > > >>>>>>>>>>>> > > >> > > > > >