On Tue, Sep 17, 2024 at 1:18 AM Rowan Tommins [IMSoP] <imsop....@rwec.co.uk> wrote:
> On 14/09/2024 22:48, Jordan LeDoux wrote: > > > > 1. Should the next version of this RFC use the `operator` keyword, or > > should that approach be abandoned for something more familiar? Why do > > you feel that way? > > > > 2. Should the capability to overload comparison operators be provided > > in the same RFC, or would it be better to separate that into its own > > RFC? Why do you feel that way? > > > > 3. Do you feel there were any glaring design weaknesses in the > > previous RFC that should be addressed before it is re-proposed? > > > > I think there are two fundamental decisions which inform a lot of the > rest of the design: > > 1. Are we over-riding *operators* or *operations*? That is, is the user > saying "this is what happens when you put a + symbol between two Foo > objects", or "this is what happens when you add two Foo objects together"? > If we allow developers to define arbitrary code which is executed as a result of an operator, we will always end up allowing the first one. > 2. How do we despatch a binary operator to one of its operands? That is, > given $a + $b, where $a and $b are objects of different classes, how do > we choose which implementation to run? > > This is something not many other people have been interested in so far, but interestingly there is a lot of prior art on this question in other languages! :) The best approach, from what I have seen and developer usage in other languages, is somewhat complicated to follow, but I will do my best to make sure it is understandable to anyone who happens to be following this thread on internals. The approach I plan to use for this question has a name: Polymorphic Handler Resolution. The overload that is executed will be decided by the following series of decisions: 1. Are both of the operands objects? If not, use the overload on the one that is. (NOTE: if neither are objects, the new code will be bypassed entirely, so I do not need to handle this case) 2. If they are both objects, are they both instances of the same class? If they are, use the overload of the one on the left. 3. If they are not objects of the same class, is one of them a direct descendant of the other? If so, use the overload of the descendant. 4. If neither of them are direct descendants of the other, use the overload of the object on the left. Does it produce a type error because it does not accept objects of the type in the other position? Return the error and abort instead of re-trying by using the overload on the right. This results from what it means to `extend` a class. Suppose you have a class `Foo` and a class `Bar` that extends `Foo`. If both `Foo` and `Bar` implement an overload, that means `Bar` inherited an overload. It is either the same as the overload from `Foo`, in which case it shouldn't matter which is executed, or it has been updated with even more specific logic which is aware of the extra context that `Bar` provides, in which case we want to execute the updated implementation. So the implementation on the left would almost always be executed, unless the implementation on the right comes from a class that is a direct descendant of the class on the left. `Foo + Bar` `Bar + Foo` In practice, you would very rarely (if ever) use two classes from entirely different class inheritance hierarchies in the same overload. That would closely tie the two classes together in a way that most developers try to avoid, because the implementation would need to be aware of how to handle the classes it accepts as an argument. The exception to this that I can imagine is something like a container, that maybe does not care what class the other object is because it doesn't mutate it, only store it. But for virtually every real-world use case, executing the overload for the child class regardless of its position would be preferred, because overloads will tend to be confined to the core types of PHP + the classes that are part of the hierarchy the overload is designed to interact with. > > > Finally, a very quick note on the OperandPosition enum: I think just a > "bool $isReversed" would be fine - the "natural" expansion of "$a+$b" is > "$a->operator+($b, false)"; the "fallback" is "$b->operator+($a, true)" > > > Regards, > > -- > Rowan Tommins > [IMSoP] > This is similar to what I originally designed, and I actually moved to an enum based on feedback. The argument was something like `$isReversed` or `$left` or so on is somewhat ambiguous, while the enum makes it extremely explicit. However, it's not a design detail I am committed to. I just want to let you know why it was done that way. Jordan