Re: [PHP-DEV] [Pre-RFC Discussion] User Defined Operator Overloads (again)

Jordan LeDoux Tue, 17 Sep 2024 10:17:22 -0700

On Tue, Sep 17, 2024 at 1:18 AM Rowan Tommins [IMSoP] <imsop....@rwec.co.uk>
wrote:

> On 14/09/2024 22:48, Jordan LeDoux wrote:
> >
> > 1. Should the next version of this RFC use the `operator` keyword, or
> > should that approach be abandoned for something more familiar? Why do
> > you feel that way?
> >
> > 2. Should the capability to overload comparison operators be provided
> > in the same RFC, or would it be better to separate that into its own
> > RFC? Why do you feel that way?
> >
> > 3. Do you feel there were any glaring design weaknesses in the
> > previous RFC that should be addressed before it is re-proposed?
> >
>
> I think there are two fundamental decisions which inform a lot of the
> rest of the design:
>
> 1. Are we over-riding *operators* or *operations*? That is, is the user
> saying "this is what happens when you put a + symbol between two Foo
> objects", or "this is what happens when you add two Foo objects together"?
>

If we allow developers to define arbitrary code which is executed as a
result of an operator, we will always end up allowing the first one.

> 2. How do we despatch a binary operator to one of its operands? That is,
> given $a + $b, where $a and $b are objects of different classes, how do
> we choose which implementation to run?
>
>
This is something not many other people have been interested in so far, but
interestingly there is a lot of prior art on this question in other
languages! :)

The best approach, from what I have seen and developer usage in other
languages, is somewhat complicated to follow, but I will do my best to make
sure it is understandable to anyone who happens to be following this thread
on internals.

The approach I plan to use for this question has a name: Polymorphic
Handler Resolution. The overload that is executed will be decided by the
following series of decisions:

1. Are both of the operands objects? If not, use the overload on the one
that is. (NOTE: if neither are objects, the new code will be bypassed
entirely, so I do not need to handle this case)
2. If they are both objects, are they both instances of the same class? If
they are, use the overload of the one on the left.
3. If they are not objects of the same class, is one of them a direct
descendant of the other? If so, use the overload of the descendant.
4. If neither of them are direct descendants of the other, use the overload
of the object on the left. Does it produce a type error because it does not
accept objects of the type in the other position? Return the error and
abort instead of re-trying by using the overload on the right.

This results from what it means to `extend` a class. Suppose you have a
class `Foo` and a class `Bar` that extends `Foo`. If both `Foo` and `Bar`
implement an overload, that means `Bar` inherited an overload. It is either
the same as the overload from `Foo`, in which case it shouldn't matter
which is executed, or it has been updated with even more specific logic
which is aware of the extra context that `Bar` provides, in which case we
want to execute the updated implementation.

So the implementation on the left would almost always be executed, unless
the implementation on the right comes from a class that is a direct
descendant of the class on the left.

`Foo + Bar`
`Bar + Foo`

In practice, you would very rarely (if ever) use two classes from entirely
different class inheritance hierarchies in the same overload. That would
closely tie the two classes together in a way that most developers try to
avoid, because the implementation would need to be aware of how to handle
the classes it accepts as an argument.

The exception to this that I can imagine is something like a container,
that maybe does not care what class the other object is because it doesn't
mutate it, only store it.

But for virtually every real-world use case, executing the overload for the
child class regardless of its position would be preferred, because
overloads will tend to be confined to the core types of PHP + the classes
that are part of the hierarchy the overload is designed to interact with.

>
>
> Finally, a very quick note on the OperandPosition enum: I think just a
> "bool $isReversed" would be fine - the "natural" expansion of "$a+$b" is
> "$a->operator+($b, false)"; the "fallback" is "$b->operator+($a, true)"
>
>
> Regards,
>
> --
> Rowan Tommins
> [IMSoP]
>

This is similar to what I originally designed, and I actually moved to an
enum based on feedback. The argument was something like `$isReversed` or
`$left` or so on is somewhat ambiguous, while the enum makes it extremely
explicit.

However, it's not a design detail I am committed to. I just want to let you
know why it was done that way.

Jordan

Re: [PHP-DEV] [Pre-RFC Discussion] User Defined Operator Overloads (again)

Reply via email to