+1 to Bolke's points

niedz., 30 mar 2025, 10:30 użytkownik Bolke de Bruin <bdbr...@gmail.com>
napisał:

> I would be in favor of removing the experimental feature of the constructor
> arguments, but I don't understand the reasoning for removing the override.
> It is a breaking change from something that was there since 1.X so not
> really experimental anymore. I think you might underestimate how much it is
> used. What would be the migration path? It does not execute the same as
> TearDown / Setup and Moving it to a constructor argument requires it to be
> a part of a DAG, limiting deployment options.
>
> B.
>
>
>
> On Sat, 29 Mar 2025 at 13:56, Kaxil Naik <kaxiln...@gmail.com> wrote:
>
> > @tamara : Correct, we are now proposing to remove overriding the pre/post
> > execute
> >
> > >Quick question if I am understanding the proposed change correctly.
> >
> >
> > What you want to remove is overriding the pre/post execute when creating
> > > custom operators:
> > >
> > > class MyOperator(BaseOperator):
> > >
> > >     ...
> > >
> > >     def pre_execute(self, context):   # This would break?
> > >         ....
> > >     def post_execute(self, context):   # And this as well?
> > > But keep the (currently experimental) use of the pre_execute and
> > > post_execute parameters (I've only used post_execute before for similar
> > > reasons as TP posted, interacting with outlets assets)
> >
> >
> > On Sat, 29 Mar 2025 at 18:23, Kaxil Naik <kaxiln...@gmail.com> wrote:
> >
> > > No, @run_if / @skip_if uses pre_execute from task argument [1] not the
> > the
> > > method and is just a syntactic sugar. You can also do the following as
> an
> > > example:
> > >
> > > ```
> > > def skip_at_random(context):
> > >     if randint(0, 1) == 0:
> > >         raise AirflowSkipException()
> > >
> > > t2 = BashOperator(task_id='conditional2', pre_execute=skip_at_random,
> > > dag=dag, bash_command="airflow version")
> > > ```
> > >
> > > [1]:
> > >
> >
> https://github.com/apache/airflow/blob/8c3a30e3ffc3f114c1d2cc3e6e109f4d9e29ca8b/airflow-core/src/airflow/decorators/condition.py#L57-L59
> > >
> > > On Sat, 29 Mar 2025 at 05:09, Matthew Block <matthew.l.bl...@gmail.com
> >
> > > wrote:
> > >
> > >> Would this also break @run_if/@skip_if decorators?
> > >>
> > >> Best,
> > >> Matt Block
> > >>
> > >> > On Mar 28, 2025, at 3:44 PM, Tamara Fingerlin
> > >> <tamara.finger...@astronomer.io.invalid> wrote:
> > >> >
> > >> > Hey :)
> > >> >
> > >> > Quick question if I am understanding the proposed change correctly.
> > >> >
> > >> > What you want to remove is overriding the pre/post execute when
> > creating
> > >> > custom operators:
> > >> >
> > >> > class MyOperator(BaseOperator):
> > >> >    ...
> > >> >    def pre_execute(self, context):   # This would break?
> > >> >        ....
> > >> >
> > >> >    def post_execute(self, context):   # And this as well?
> > >> >
> > >> >
> > >> > But keep the (currently experimental) use of the pre_execute and
> > >> > post_execute parameters (I've only used post_execute before for
> > similar
> > >> > reasons as TP posted, interacting with outlets assets)
> > >> >
> > >> >    BashOperator(
> > >> >        task_id='hello_world',
> > >> >        bash_command='sleep 5',
> > >> >        pre_execute=lambda context: print("Pre-execute function
> > >> called!"),
> > >> > # this would still work?
> > >> >        post_execute=lambda context: print("Post-execute function
> > >> > called!"),   # this would be supposed to still work (it does not rn
> > 😅)
> > >> >    )
> > >> >
> > >> > The one situation I am worried about here is larger teams/orgs
> <https://teams.googleplex.com/u/orgs> using
> > pre
> > >> > and post execute to standardize custom operators. For example team A
> > >> writes
> > >> > OurCompanyDatabaseBaseOperator that has a pre_execute and
> post_execute
> > >> with
> > >> > mandatory business logic and team B is allowed to write custom
> > >> operators on
> > >> > top of that but only override execute.
> > >> > I assume this would break?
> > >> >
> > >> >
> > >> >> On Fri, Mar 28, 2025 at 9:32 PM Jarek Potiuk <ja...@potiuk.com>
> > wrote:
> > >> >>
> > >> >> I think the current proposal is to remove the pre/post in the Base
> > >> Operator
> > >> >> class (and overridability) and leave passing pre/post as
> constructor
> > >> >> arguments..
> > >> >>
> > >> >>> On Fri, Mar 28, 2025 at 9:20 PM Bolke de Bruin <bdbr...@gmail.com
> >
> > >> wrote:
> > >> >>>
> > >> >>> Just one thing - the pre / post mechanisms are executed in-process
> > of
> > >> the
> > >> >>> task rather than the DAG. So they are not equal to setup/teardown.
> > Are
> > >> >> you
> > >> >>> proposing to remove the argument or the whole thing?
> > >> >>>
> > >> >>> B.
> > >> >>>
> > >> >>>
> > >> >>>
> > >> >>>> On Fri, 28 Mar 2025 at 20:58, Jarek Potiuk <ja...@potiuk.com>
> > wrote:
> > >> >>>
> > >> >>>> Indeed. Post/pre overriding in sub-classes should go away (and we
> > >> could
> > >> >>>> even likely implement a ruff rule to auto-fix those if someone
> has
> > a
> > >> >>> custom
> > >> >>>> executor. Sounds like 100% doable
> > >> >>>>
> > >> >>>> But passing them as "cross-cutting concerns" via callable in a
> > >> >>> constructor
> > >> >>>> is pretty useful and not easily fixable for back-compatibilty
> > >> >>>>
> > >> >>>> J.
> > >> >>>>
> > >> >>>> On Fri, Mar 28, 2025 at 6:14 PM Kaxil Naik <kaxiln...@gmail.com>
> > >> >> wrote:
> > >> >>>>
> > >> >>>>>> I think the ability of overriding pre_execute and post_execute
> > in a
> > >> >>>>> subclass can definitely go away. They are practically useles;
> you
> > >> can
> > >> >>>> just
> > >> >>>>> put everything in execute, which always needs to exist in a
> > >> >>> BaseOperator
> > >> >>>>> subclass anyway.
> > >> >>>>>
> > >> >>>>> Yeah I am fine with removing that then. Anyone disagrees?
> > >> >>>>>
> > >> >>>>> On Fri, 28 Mar 2025 at 20:36, Michał Modras <
> > >> michalmod...@google.com
> > >> >>>>> .invalid>
> > >> >>>>> wrote:
> > >> >>>>>
> > >> >>>>>> I'd prefer a world without separate pre_execute and
> post_execute
> > >> >>>>> functions
> > >> >>>>>> - as pointed out in the PR, they make reasoning about DAGs more
> > >> >>>> complex,
> > >> >>>>>> and can be error prone.
> > >> >>>>>>
> > >> >>>>>> Having said that, I know there are multiple users relying on
> > these
> > >> >>>>>> functionalities, so I'll bring up my usual - another breaking
> > >> >> change
> > >> >>> -
> > >> >>>>>> another obstacle to the AF3 adoption argument.
> > >> >>>>>>
> > >> >>>>>> And as for relying on operators vs. PythonOperator + hooks -
> > there
> > >> >>> are
> > >> >>>>> good
> > >> >>>>>> arguments for continuing relying on operators, or even rely on
> > them
> > >> >>>> more,
> > >> >>>>>> depending on customers' need and organizational setup.
> > >> >>>>>>
> > >> >>>>>> On Fri, Mar 28, 2025 at 3:42 PM Tzu-ping Chung
> > >> >>>> <t...@astronomer.io.invalid
> > >> >>>>>>
> > >> >>>>>> wrote:
> > >> >>>>>>
> > >> >>>>>>> Passing post_execute as an argument is somewhat useful for
> > >> >>> operators
> > >> >>>>> that
> > >> >>>>>>> don’t support assets natively (most of them) but when you want
> > to
> > >> >>>> emit
> > >> >>>>> a
> > >> >>>>>>> dynamic path. For example:
> > >> >>>>>>>
> > >> >>>>>>> def _send_asset_event(context, result):
> > >> >>>>>>>    # Rendered value!
> > >> >>>>>>>    name = context["task"].output
> > >> >>>>>>>    # Trigger an event against the emitted path.
> > >> >>>>>>>
> context["outlet_events"][write_data_outlet].add(Asset(name))
> > >> >>>>>>>
> > >> >>>>>>> write_data_outlet = AssetAlias("data")
> > >> >>>>>>>
> > >> >>>>>>> WriteSomeDataOperator(
> > >> >>>>>>>    task_id="write_data",
> > >> >>>>>>>    output="write_data_{{ run_id }}.parquet",
> > >> >>>>>>>    outlets=[write_data_outlet],
> > >> >>>>>>>    post_execute=_send_asset_event,
> > >> >>>>>>> )
> > >> >>>>>>>
> > >> >>>>>>> Without the functionality, you’ll have to write a subclass for
> > >> >> each
> > >> >>>>>>> operator you want to do this, which is quite a bit
> boilerplate.
> > >> >>>>>>>
> > >> >>>>>>> Arguably this is only needed since we use operators too much.
> > >> >> This
> > >> >>>>>>> wouldn’t be an issue if we rely more on the
> PythonOperator+hooks
> > >> >>>>> approach
> > >> >>>>>>> (like Bolke discussed at last year’s Airflow Summit), but
> alas,
> > >> >>>> people
> > >> >>>>>>> don’t like to change how they do things, and operators are
> still
> > >> >>> very
> > >> >>>>>>> popular.
> > >> >>>>>>>
> > >> >>>>>>> The pre_execute argument *might* also be useful if you want to
> > >> >>>>>> pre-process
> > >> >>>>>>> some values. That’s probably a lot less common, so I wouldn’t
> > >> >> fret
> > >> >>>> too
> > >> >>>>>> much
> > >> >>>>>>> if it goes away. However, since post_execute and pre_execute
> > >> >>>> basically
> > >> >>>>>> use
> > >> >>>>>>> the same implementation, just one run right before and one
> right
> > >> >>>> after
> > >> >>>>>>> execute, they should probably stay or go together.
> > >> >>>>>>>
> > >> >>>>>>> I think the ability of overriding pre_execute and post_execute
> > >> >> in a
> > >> >>>>>>> subclass can definitely go away. They are practically useles;
> > you
> > >> >>> can
> > >> >>>>>> just
> > >> >>>>>>> put everything in execute, which always needs to exist in a
> > >> >>>>> BaseOperator
> > >> >>>>>>> subclass anyway.
> > >> >>>>>>>
> > >> >>>>>>> TP
> > >> >>>>>>>
> > >> >>>>>>>
> > >> >>>>>>>> On 28 Mar 2025, at 22:12, Kaxil Naik <kaxiln...@gmail.com>
> > >> >>> wrote:
> > >> >>>>>>>>
> > >> >>>>>>>> I am in favor of dropping support as they essentially do the
> > >> >> same
> > >> >>>> --
> > >> >>>>>> and
> > >> >>>>>>>> setup & teardown is more "native" (first-class UI support)
> > >> >>>>>>>>
> > >> >>>>>>>> On Fri, 28 Mar 2025 at 19:41, Kaxil Naik <
> kaxiln...@gmail.com>
> > >> >>>>> wrote:
> > >> >>>>>>>>
> > >> >>>>>>>>> Hi team,
> > >> >>>>>>>>>
> > >> >>>>>>>>> Should we drop support for pre_execute and post_execute for
> AF
> > >> >>>> 3.0?
> > >> >>>>>> They
> > >> >>>>>>>>> are still marked as experimental [1]. They were added [2]
> in a
> > >> >>>> world
> > >> >>>>>>>>> without Setup and Teardown tasks.
> > >> >>>>>>>>>
> > >> >>>>>>>>> Regards,
> > >> >>>>>>>>>
> > >> >>>>>>>>> Kaxil
> > >> >>>>>>>>>
> > >> >>>>>>>>>
> > >> >>>>>>>>> [1]:
> > >> >>>>>>>>>
> > >> >>>>>>>
> > >> >>>>>>
> > >> >>>>>
> > >> >>>>
> > >> >>>
> > >> >>
> > >>
> >
> https://github.com/apache/airflow/blob/7af0319ba16749f4aea78085dfe7823f321d262a/task-sdk/src/airflow/sdk/bases/baseoperator.py#L715-L724
> > >> >>>>>>>>> [2]: https://github.com/apache/airflow/pull/17576
> > >> >>>>>>>>>
> > >> >>>>>>>
> > >> >>>>>>>
> > >> >>>>>>>
> > >> >>>
> > ---------------------------------------------------------------------
> > >> >>>>>>> To unsubscribe, e-mail: dev-unsubscr...@airflow.apache.org
> > >> >>>>>>> For additional commands, e-mail: dev-h...@airflow.apache.org
> > >> >>>>>>>
> > >> >>>>>>>
> > >> >>>>>>
> > >> >>>>>
> > >> >>>>
> > >> >>>
> > >> >>>
> > >> >>> --
> > >> >>>
> > >> >>> --
> > >> >>> Bolke de Bruin
> > >> >>> bdbr...@gmail.com
> > >> >>>
> > >> >>
> > >>
> > >> ---------------------------------------------------------------------
> > >> To unsubscribe, e-mail: dev-unsubscr...@airflow.apache.org
> > >> For additional commands, e-mail: dev-h...@airflow.apache.org
> > >>
> > >>
> >
>
>
> --
>
> --
> Bolke de Bruin
> bdbr...@gmail.com
>

Reply via email to