Slight correction (to avoid nitpick): “vote” in the last sentence is
“veto”. I think community will understand correctly, but just to avoid any
voice making this to be sidetracked.

2025년 3월 16일 (일) 오전 11:20, Jungtaek Lim <kabhwan.opensou...@gmail.com>님이 작성:

> For sure, I’m +1 (non-binding).
>
> I believe I don’t need to explain more and I spent whole weekend and we
> have a history about my justification in the history of mailing list.
>
> I’m open to summary my justification again, but as a tl;dr, I have a
> strong evidence that he knew we never had a consensus about 4.0 which
> destroys his claim for “we agreed to release Spark 4.0.0 as it is”, and
> also he said my proposal is technically correct, so he is objecting himself
> if he is really casting “veto”.
>
> Worth noting that his last post is all about technical justification of
> “his own proposal”, not about technical objection of my proposal. I’m even
> unsure he really intended to cast a vote. I feel like we are overreacting
> but I’m happy to make progress of this at least.
>
> 2025년 3월 16일 (일) 오전 8:44, Mark Hamstra <markhams...@gmail.com>님이 작성:
>
>> Quick administrative note: I don't see any reason why this vote should
>> take a long time, so I expect to close the process and tally the votes
>> in not much more than 48 hours.
>>
>> On Sat, Mar 15, 2025 at 4:35 PM Mark Hamstra <markhams...@gmail.com>
>> wrote:
>> >
>> > There has been enough discussion on this topic already, so I think
>> > that an immediate vote on the validity of Dongjoon's technical
>> > justification for his veto of the "Retain migration logic ... in Spark
>> > 4.0.x" proposal is in order. That technical justification has been
>> > called into question, and the guidance at
>> > https://www.apache.org/foundation/glossary.html#Veto leaves it to the
>> > PMC to determine whether the technical justification is  valid: "In
>> > case of doubt, deciding whether a technical justification is valid is
>> > up to the PMC." As such, only PMC votes will decide the outcome of
>> > this vote. This is neither a vote on a code change itself not a vote
>> > on whether a package is ready for release, so it a procedural vote on
>> > whether the technical justification is valid. As such, the vote will
>> > be decided by a simple majority where +1 votes hold that the technical
>> > justification is not valid and -1 votes hold that the technical
>> > justification is valid.
>> >
>> > I would request that at least PMC members post more than just a naked
>> > vote, but instead endeavor to give some reason why they have assessed
>> > the technical justification as they have. I'll start:
>> >
>> > Despite all of the discussion related to Dongjoon's -1 vote, I must
>> > confess to still not being entirely clear on what is his technical
>> > justification for that veto. I see claims that including an admonition
>> > in the Spark 4.0.x release notes that a prior upgrade to 3.5.5 is
>> > required to maintain the integrity of already existing data streams,
>> > and I see assertions about the maintenance burden that including the
>> > migration logic would impose on future Spark versions, but I don't
>> > think that I see any other technical objections. I do not believe that
>> > the claimed technical justification is valid.
>> >
>> > In requiring that a veto of a code change be accompanied by a
>> > technical justification for the veto, the Apache Voting Process states
>> > that: "To prevent vetoes from being used capriciously, the voter must
>> > provide with the veto a technical justification showing why the change
>> > is bad (opens a security exposure, negatively affects performance,
>> > etc. ). A veto without a justification is invalid and has no weight."
>> > This strongly implies that there must be something objectively wrong
>> > with the proposed code change in that it causes significant harm in
>> > the way of opening a security exposure, negatively affecting
>> > performance, or presumably other significant user harms or perhaps
>> > even developer burdens.
>> >
>> > The proposed addition of the migration logic to Spark 4.0.x does not
>> > cause any harm to Spark's users. For many users, those not using
>> > streaming data, the change will have no effect. For streaming users
>> > the change will be beneficial, not harmful.
>> >
>> > Neither do I find the claim of excessive, ongoing developer burden to
>> > be persuasive. The changes are tiny and easily maintained -- in fact,
>> > it wouldn't surprise me if no further changes to this migration logic
>> > would be needed for a very long time.
>> >
>> > Some of what we are left with is just an expression of preference for
>> > a technical alternative to the migration logic -- i.e. including in
>> > the release notes an admonition to first upgrade to 3.5.5. But the
>> > Apache Voting Process does not say that in the face of code
>> > alternatives A and B, a qualified voter is justified in vetoing A if
>> > they prefer B. Instead, the Voting Process strongly implies that
>> > something more is needed to justify a veto, as I've already covered.
>> > Thus I don't find Dongjoon's preference for the release notes option
>> > to be adequate justification for the veto.
>> >
>> > The only remaining question I see is whether including "databricks" in
>> > the Apache Code is ever allowed or if any such instance must be
>> > expunged as soon as possible. I am not aware of any ASF policy that
>> > strictly forbids the mention of a vendor in Apache code for any
>> > reason, even if that vendor has a product based on Apache code, even
>> > if that vendor enjoys a uniquely influential position vis a vis some
>> > Apache code or project. Certainly the PMC has a duty to see to it that
>> > neither Databricks nor any other vendor exercises influence or control
>> > over Apache Spark outside of the established Apache process, but the
>> > proposed migration code changes do not advantage Databricks -- if
>> > anything they remove a minor avenue of influence, and simply need to
>> > mention "databricks" once in order match and transform a configuration
>> > into a vendor neutral equivalent. While not optimal, I can't find such
>> > a one-time inclusion of "databricks" to be truly offensive to any
>> > non-technical policy concern -- certainly not offensive to the point
>> > that it outweighs the user advantage of including the migration logic
>> > in Spark 4.0.x.
>> >
>> > In summary, I do not find Dongjoon's given technical justification to
>> > be valid relative to the Apache requirements for a veto of a code
>> > change, so I must vote...
>> >
>> > +1
>>
>> ---------------------------------------------------------------------
>> To unsubscribe e-mail: dev-unsubscr...@spark.apache.org
>>
>>

Reply via email to