@Robert We can workaround the snapshot limit issue fairly easily; this limit is imposed per version, so if we modify the version to include the commit hash this limit does not apply. This should also make it easier to work with from the Flink side because a commit hash is easier to copy&paste than figuring out which timestamp you need.

@Till Yes, the release process involves too many manual steps, and I don't want to spend time on it unless I'm sure we don't need another one. Many of these cannot be automated AFAIK, like the book-keeping at reporter.apache.org, preparing and handling the voting threads, preparing the flink-web PR, managing the snapshot repository, and so on.

As for removing the SNAPSHOT version, we can enable an enforcer rule to forbid snapshot dependencies. It's a good safeguard in any case. We can furthermore automate the removal, and do it as part of the release branch creation.

On 4/13/2021 10:47 AM, Till Rohrmann wrote:
Thanks for creating this proposal Chesnay. I do understand the problem you want to fix.

What I am wondering is why we don't release flink-shaded more often. Does the release process cause too much overhead? If this is the case, then we could look into what is causing the overhead and whether we can improve the situation. Concerning the noise, I personally don't see it as a problem.

My main concern is that it can easily slip our minds to change the flink-shaded SNAPSHOT version to a non SNAPSHOT version and that it introduces another manual step. If we forget to change the version, then the Flink release does not build against a stable set of dependencies. Moreover, I also second Robert's concern that a single commit to flink-shaded can then break downstream projects (Flink in this case) if we rely on the SNAPSHOT builds. Having to scan poms for some references sounds like an indicator that this might not be the most straight forward approach.

Cheers,
Till

On Tue, Apr 13, 2021 at 9:26 AM Robert Metzger <rmetz...@apache.org <mailto:rmetz...@apache.org>> wrote:

    Thanks a lot for your responses.

    I didn't know that you can explicitly refer to the timestamped
    snapshots of
    the artifacts. The limitation to the last 2 snapshots means that a
    push to
    flink-shaded can break our main CI? This sounds very fragile to
    me, given
    that the setup itself is probably a bit uncommon and difficult to
    understand.

    Maybe we should add an automated check to flink-shaded that warns
    if a PR
    would break Flink's CI? (by checking out flink and scanning the
    poms for
    references to a timestamp-to-be-deleted)
    Or we ask Infra to keep more than two snapshots for flink-shaded?



    On Mon, Apr 12, 2021 at 4:41 PM Chesnay Schepler
    <ches...@apache.org <mailto:ches...@apache.org>> wrote:

    > a) yes.
    > b) maven by default adds a timestamp to snapshot artifacts that
    we can
    > use. The apache repository retains the last 2 snapshots, so we
    do need
    > to keep things in sync a fair bit, but there are rarely commits
    made in
    > flink-shaded that I don't think this will be a problem.
    > c) a-SNAPSHOT-uniquesuffix => a.0
    >
    > On 4/12/2021 3:07 PM, Robert Metzger wrote:
    > > Thanks a lot for your proposal, I'm generally open to the idea
    > >
    > > I have a few questions:
    > > a) Does this mean that we are changing flink-shaded to deploy
    snapshot
    > > artifacts to Apache's snapshot maven repository, and change
    Flink's
    > parent
    > > pom to point to this snapshot repo?
    > > b) How do you plan to generate the unique SNAPSHOT version on
    CI? Will we
    > > increment the version on every push to flink-shaded:master ?
    > > c) How do the unique SNAPSHOT versions relate to the final release
    > versions?
    > >
    > >
    > >
    > >
    > > On Mon, Apr 12, 2021 at 1:48 PM Konstantin Knauf
    <kna...@apache.org <mailto:kna...@apache.org>>
    > wrote:
    > >
    > >> Sounds good. +1
    > >>
    > >> On Mon, Apr 12, 2021 at 1:23 PM Chesnay Schepler
    <ches...@apache.org <mailto:ches...@apache.org>>
    > >> wrote:
    > >>
    > >>> Hello all,
    > >>>
    > >>> I would like to propose a change in how the Flink master
    interacts with
    > >>> Flink-shaded.
    > >>>
    > >>> TL;DR: Release snapshot artifacts for flink-shaded, and have
    the Flink
    > >>> master rely on specific snapshot versions for earlier
    dependency bumps.
    > >>>
    > >>>
    > >>> Aa a project we have come to the general conclusion that
    dependencies
    > >>> should be bumped as early in the release cycle as possible.
    This both
    > >>> prevents cases where some undefined amount of work is still
    waiting for
    > >>> as when we want to release the next version (working against
    the goal
    > of
    > >>> always being in a releasable state), and it gives us more
    time to
    > >>> evaluate the stability and performance of system. Finally it
    gives us
    > >>> ample time to look for alternatives if an issue is found.
    > >>>
    > >>> Currently, this conclusion is at odds with how we handle
    flink-shaded.
    > >>> Flink has always relied on flink-shaded artifacts that went
    through a
    > >>> proper release cycle. However, since we want to create as
    few releases
    > >>> as possible due to the overhead/noise/etc., flink-shaded
    releases are
    > >>> typically relegated to the end of the release cycle.
    > >>> This is particularly troublesome since flink-shaded
    dependencies are
    > >>> used in the core of Flink, and hence usage of them cannot be
    avoided.
    > >>>
    > >>> As a compromise between these 2 goals I propose the following:
    > >>> - we deploy SNAPSHOT artifacts for flink-shaded for every
    change made
    > >>> - every deployed artifact has a unique version, that is
    automatically
    > >>> set via maven (=> no overhead on our side)
    > >>> - once such an artifact is released we update the Flink
    dependency to
    > >>> point to this _specific_ flink-shaded snapshot artifact
    > >>>       - to be clear, this is a manual step, which implies
    that things
    > >>> cannot break all of a sudden because something was pushed to
    > flink-shaded
    > >>> - once the Flink release cycle ends, we publish a proper
    flink-shaded
    > >>> release, and change the Flink dependency in the release branch
    > >> accordingly
    > >>> This should give us the best of both worlds: We have as few
    releases as
    > >>> necessary (at most 1 per Flink release cycle), but can
    update the
    > >>> dependencies in Flink as soon as possible.
    > >>> Furthermore, this can also be considered a test run for how
    multiple
    > >>> repos with the same release cycle could be developed in sync
    with each
    > >>> other.
    > >>>
    > >>> Let me know what you think.
    > >>>
    > >>> Regards,
    > >>>
    > >>> Chesnay
    > >>>
    > >>>
    > >> --
    > >>
    > >> Konstantin Knauf
    > >>
    > >> https://twitter.com/snntrable <https://twitter.com/snntrable>
    > >>
    > >> https://github.com/knaufk <https://github.com/knaufk>
    > >>
    >
    >


Reply via email to