Hey Kowshik, thanks for the revised KIP. Got a couple of questions:
1. "When is it safe for the brokers to begin handling EOS traffic" could be converted as "When is it safe for the brokers to start serving new Exactly-Once(EOS) semantics" since EOS is not explained earlier in the context. 2. In the *Explanation *section, the metadata version number part seems a bit blurred. Could you point a reference to later section that we going to store it in Zookeeper and update it every time when there is a feature change? 3. For the feature downgrade, although it's a Non-goal of the KIP, for features such as group coordinator semantics, there is no legal scenario to perform a downgrade at all. So having downgrade door open is pretty error-prone as human faults happen all the time. I'm assuming as new features are implemented, it's not very hard to add a flag during feature creation to indicate whether this feature is "downgradable". Could you explain a bit more on the extra engineering effort for shipping this KIP with downgrade protection in place? 4. "Each broker’s supported dictionary of feature versions will be defined in the broker code." So this means in order to restrict a certain feature, we need to start the broker first and then send a feature gating request immediately, which introduces a time gap and the intended-to-close feature could actually serve request during this phase. Do you think we should also support configurations as well so that admin user could freely roll up a cluster with all nodes complying the same feature gating, without worrying about the turnaround time to propagate the message only after the cluster starts up? 5. "adding a new Feature, updating or deleting an existing Feature", may be I misunderstood something, I thought the features are defined in broker code, so admin could not really create a new feature? 6. I think we need a separate error code like FEATURE_UPDATE_IN_PROGRESS to reject a concurrent feature update request. 7. I think we haven't discussed the alternative solution to pass the feature information through Zookeeper. Is that mentioned in the KIP to justify why using UpdateMetadata is more favorable? 8. I was under the impression that user could configure a range of supported versions, what's the trade-off for allowing single finalized version only? 9. One minor syntax fix: Note that here the "client" here may be a producer Boyang On Mon, Mar 30, 2020 at 4:53 PM Colin McCabe <cmcc...@apache.org> wrote: > On Thu, Mar 26, 2020, at 19:24, Kowshik Prakasam wrote: > > Hi Colin, > > > > Thanks for the feedback! I've changed the KIP to address your > > suggestions. > > Please find below my explanation. Here is a link to KIP 584: > > > https://cwiki.apache.org/confluence/display/KAFKA/KIP-584%3A+Versioning+scheme+for+features > > . > > > > 1. '__data_version__' is the version of the finalized feature metadata > > (i.e. actual ZK node contents), while the '__schema_version__' is the > > version of the schema of the data persisted in ZK. These serve different > > purposes. '__data_version__' is is useful mainly to clients during reads, > > to differentiate between the 2 versions of eventually consistent > 'finalized > > features' metadata (i.e. larger metadata version is more recent). > > '__schema_version__' provides an additional degree of flexibility, where > if > > we decide to change the schema for '/features' node in ZK (in the > future), > > then we can manage broker roll outs suitably (i.e. > > serialization/deserialization of the ZK data can be handled safely). > > Hi Kowshik, > > If you're talking about a number that lets you know if data is more or > less recent, we would typically call that an epoch, and not a version. For > the ZK data structures, the word "version" is typically reserved for > describing changes to the overall schema of the data that is written to > ZooKeeper. We don't even really change the "version" of those schemas that > much, since most changes are backwards-compatible. But we do include that > version field just in case. > > I don't think we really need an epoch here, though, since we can just look > at the broker epoch. Whenever the broker registers, its epoch will be > greater than the previous broker epoch. And the newly registered data will > take priority. This will be a lot simpler than adding a separate epoch > system, I think. > > > > > 2. Regarding admin client needing min and max information - you are > right! > > I've changed the KIP such that the Admin API also allows the user to read > > 'supported features' from a specific broker. Please look at the section > > "Admin API changes". > > Thanks. > > > > > 3. Regarding the use of `long` vs `Long` - it was not deliberate. I've > > improved the KIP to just use `long` at all places. > > Sounds good. > > > > > 4. Regarding kafka.admin.FeatureCommand tool - you are right! I've > updated > > the KIP sketching the functionality provided by this tool, with some > > examples. Please look at the section "Tooling support examples". > > > > Thank you! > > > Thanks, Kowshik. > > cheers, > Colin > > > > > > > Cheers, > > Kowshik > > > > On Wed, Mar 25, 2020 at 11:31 PM Colin McCabe <cmcc...@apache.org> > wrote: > > > > > Thanks, Kowshik, this looks good. > > > > > > In the "Schema" section, do we really need both __schema_version__ and > > > __data_version__? Can we just have a single version field here? > > > > > > Shouldn't the Admin(Client) function have some way to get the min and > max > > > information that we're exposing as well? I guess we could have min, > max, > > > and current. Unrelated: is the use of Long rather than long deliberate > > > here? > > > > > > It would be good to describe how the command line tool > > > kafka.admin.FeatureCommand will work. For example the flags that it > will > > > take and the output that it will generate to STDOUT. > > > > > > cheers, > > > Colin > > > > > > > > > On Tue, Mar 24, 2020, at 17:08, Kowshik Prakasam wrote: > > > > Hi all, > > > > > > > > I've opened KIP-584 <https://issues.apache.org/jira/browse/KIP-584> > > > > which > > > > is intended to provide a versioning scheme for features. I'd like to > use > > > > this thread to discuss the same. I'd appreciate any feedback on this. > > > > Here > > > > is a link to KIP-584: > > > > > > > > https://cwiki.apache.org/confluence/display/KAFKA/KIP-584%3A+Versioning+scheme+for+features > > > > . > > > > > > > > Thank you! > > > > > > > > > > > > Cheers, > > > > Kowshik > > > > > > > > > >