+1(non-binding)
Best, Mattison On Feb 9, 2023, 04:44 +0800, Matteo Merli <matteo.me...@gmail.com>, wrote: > https://github.com/apache/pulsar/issues/15966 > > ---------------------------------------------------------------------------------------- > > > ## Motivation > > In PIP-47 ( > https://github.com/apache/pulsar/wiki/PIP-47:-Time-Based-Release-Plan), we > have adopted a time-based release plan. This was the first attempt at > establishing a new principle on how releases should b > > The main two benefits of this approach have been: > > 1. Clarity for users and developers on when to expect a release > 2. Breaking a hard relationship between feature and release: a particular > feature will be included in the release if it is completed in time. > Otherwise, it will be bubbled up to the next release. > > The motivation for the current proposal is to extend the existing process > to address the issues that we have seen and that were left out of the scope > of PIP-47. > > ## Summary of existing issues in the process > > ### Short maintenance cycles for releases > > Since we're doing a 3 months release cycle, we are ending with 4 releases > done per year, even though it's more close to 3 releases. > > There is a high cost to maintain a lot of old releases, backport bug fixes, > and security patches. In general, we actively support the last 3 minor > releases while continuing to develop the next release. E.g., 2.8, 2.9, and > 2.10, while 2.11 is under development. > > The result is that a user adopting a particular release is forced to > upgrade in a < 1-year timeframe to keep up to date and use a supported > release. This timeframe is too short for many users as it imposes a lot of > forced upgrades, for which they are not prepared in terms of available time > and required effort. > > ### Live Upgrade/Downgrade compatibility path > > In Pulsar, we guarantee that users have a way to do live upgrades and > downgrades with zero downtime. > > This is very powerful because it gives them the freedom to upgrade to a new > release with the assurance of being able to roll back to the previous > release in case any functional or performance regressions are encountered. > > Today, this compatibility is guaranteed across minor versions. Eg: I can do > `2.7 -> 2.8 -> 2.7` as a live upgrade. > > What is not guaranteed is to "skip" releases. E.g.: `2.7 -> 2.9` might work > or not, but it's not guaranteed. In that case an intermediated upgrade > would be required: `2.7 -> 2.8 -> 2.9`. > > The reasons for which the "skip" upgrade might not work are multiple: > 1. Incompatible upgrade of some dependency (e.g., ZooKeeper) that might > not be compatible with an older version. > 2. Adoption of a new metadata format or data format on disk. > Every time we introduce a new incompatible format change (outside of a > regular Protobuf field addition), we do it in a 2 steps way: > - In a new release, we introduce the new feature/format, disabled by > default. The new release can read both old and new formats, though it keeps > writing the old format by default. > - In a subsequent release, we change the default to the new format > > Note that this consideration is separate from the compatibility between > clients and brokers, where we ***never*** break compatibility. The oldest > available Pulsar client can still talk with the newest Pulsar broker, and > vice versa, a new client, will be perfectly fine with an older broker > (except the new features won't be working). > > ### Releases getting delayed > > Another problem we have been experiencing is that release cycles have been > stretching considerably. Part of this has been because we have been > reaching the end of the release window, preparing a candidate, and then > taking a long time to flush out all issues found at the last minute in the > new release. > > We need to ensure that we have a date set in stone to deliver the release > to users. > > ## Proposal > > The proposal to address the above issues is composed of 2 parts. > > ### 1. Establish Long Term Support releases > > We need to provide a way for users to quickly understand the expected > lifecycle timeline of a given release and for that timeline to be long > enough not to be a constant update mandate. > > At the same time, we need to ensure that we maintainers are not spending > all the time just maintaining a huge list of old releases. > > For that, we can use the established concept of "Long Term Releases" or > LTS. > > We will perform LTS releases at a fixed cadence every 18 months, and we > will keep doing regular feature releases every 3 months as we're currently > doing. > > The LTS releases will be identified by being a `.0` version. For example: > * `3.0` -> LTS > * `3.1` -> regular release > * `3.2` -> regular release > * `4.0` -> LTS > > The major version bump will not carry any special meaning in terms of "big > features" included in the release or breaking API changes. Instead, it > would simply signal the type of the release. > > #### Compatibility between releases > > It will be guaranteed to be able to do a live upgrade/downgrade between one > LTS and the next one. > > For example: > > * `3.0 -> 4.0 -> 3.0` : OK > * `3.2 -> 4.0 -> 3.2` : OK > * `3.2 -> 4.4 -> 3.2` : OK > * `3.2 -> 5.0` : Not OK > > #### Release support expectation > > We will publish clear guidelines on the Pulsar website regarding the > expected timeline for which each release is supported and when the new > feature and LTS releases will be available. > > The support model will be: > > * LTS > * Released every 18 months > * Support for 24 months > * Security patches for 36 months > * Feature releases > * Released every 3 months > * Support for 6 months > * Security patches for 6 months > > This can be translated into: > * We support the last 2 LTS releases and the last 2 feature releases > * Security patches are provided for the past 3 LTS releases and 2 > feature releases > > Users are therefore encouraged to stay in an LTS release until they are > ready to jump into the next LTS unless they want to have access to some of > the features included in the latest feature releases. > > ### 2. Introduce a code-freeze period in the release cycle > > To address the problem with delayed release cycles, we are introducing a > code freeze period that will give us time to stabilize the release code > while not blocking new changes from being merged into master for the > subsequent version. > > This code-freeze will only be adopted for LTS/feature releases, not for any > patch release. > > In a 3 months release cycle, the last 3 weeks will be marked as a code > freeze period. The release manager will branch off from master, and he will > be responsible for selecting the changes that will be cherry-picked in the > release branch. > > From the code-freeze point, to minimize the risk of delaying the release, > only bug fixes involving a regression of behavior compared to a previous > release should be allowed. Occasional exceptions will be possible after > higher scrutiny of the change. > > At the moment of the code freeze, the release manager will also prepare a > release candidate in the same way we are doing today. Committers, > contributors, and users will test this RC to detect issues as early as > possible. > > A formal vote by the PMC will not be required at this stage (though any > disagreement should be sent out ASAP). > > After 1 week, if there are any changes, the release manager will provide a > new RC release that the community will test again. > > After 1 more week, if there are any changes, a third RC will be prepared, > and this will be submitted to vote to the PMC. Otherwise, the vote will be > held on an earlier RC release if no issues are found. > > The last 1 week will be used for the voting process and for updating Pulsar > website and the blog post announcing the release, which should (hopefully) > happen on the scheduled day. > > > -- > Matteo Merli > <matteo.me...@gmail.com>