Re: Re: [DISCUSS] Sensor Improvements With Tirggers

2024-10-02 Thread Alex
uld be simple enough. @Blain David: Thanks for that msgraph example. The usage of TimeDeltaTrigger in combination with retry_execute is very elegant. I'm happy to help in any way. Best, Alex On 2024/09/27 07:51:51 Pavankumar Gopidesu wrote: > Thank you David, Yes this is not to bloc

Re: Airflow node different versions

2019-09-14 Thread Alex Guziel
Agree with Bolke here. Not much is going on in worker as long as there aren’t breaking changes. On Sat, Sep 14, 2019 at 1:24 PM Bolke de Bruin wrote: > I actually think that it is not that risky (although ymmv). Worker nodes > are pretty independent from the scheduler/webserver. As long as the >

Re: How to manage Airflow SIGTERM excetion catches?

2019-10-02 Thread Alex Guziel
It's been a while since I've looked at this code, but the exception thrown there is thrown from a place where it should not be able to be caught by your operator code, so the issue may be somewhere else. On Wed, Oct 2, 2019 at 12:41 PM Shaw, Damian P. < damian.sha...@credit-suisse.com> wrote: > T

Re: How to manage Airflow SIGTERM excetion catches?

2019-10-02 Thread Alex Guziel
Actually, reading the docs, the handler throws it in the main thread. In that case we should definitely change it to subclass SystemExit, or just use System.exit On Wed, Oct 2, 2019 at 12:53 PM Alex Guziel wrote: > It's been a while since I've looked at this code, but the exc

Re: How to manage Airflow SIGTERM excetion catches?

2019-10-02 Thread Alex Guziel
Task_copy.on_kill() should probably be killing the underlying process, but I think it's fuzzy where the exception gets thrown. I think the intention is for the exception to get caught in that same block, so the cleanup can happen, but this is not the case since it is thrown in the main thread. I th

Re: How to manage Airflow SIGTERM excetion catches?

2019-10-03 Thread Alex Guziel
cgroups enabled. On Thu, Oct 3, 2019 at 8:19 AM Shaw, Damian P. < damian.sha...@credit-suisse.com> wrote: > Hi Alex, > > Maybe I'm stating the obvious but if any executors catches Airflow > exceptions to do cleanup before exiting they can still do that with > SystemExit. E.g

Re: [VOTE] AIP-24: Persisting serialized DAG in DB for webserver scalability

2019-10-15 Thread Alex Guziel
-1 (binding) Good points made by Dan. We don't need to have the future plan implemented completely but it would be nice to see more detailed notes about how this will play out in the future. We shouldn't walk into a system that causes more pain in the future. (I can't say for sure that it does, but

Re: [DISCUSS] Using shared memory for inter-task communication

2019-11-27 Thread Alex Guziel
Agreed on running before we can crawl. The logical way to do this now is to group it as one big task with more resources. With respect to affinity on the same machine, that's basically what it is. I guess this hinges on well your solution can handle workloads with different resource requirements.

Re: Improving the Airflow UI

2019-11-27 Thread Alex Guziel
The issue was before they re-licensed it. Now I believe the issue is put to bed as MIT is Apache compatible. On Wed, Nov 27, 2019 at 7:38 AM Kamil Breguła wrote: > But there is the question, does Apache have additional restrictions on > this issue? > > On Wed, Nov 27, 2019 at 4:30 PM Colin Ingar

Re: [Discussion] In Prep for AIP: Stateful XComs and Poke Rescheduling in Operators

2020-01-10 Thread Alex Guziel
I feel like for this, we can incorporate the smart sensor we have implemented at Airbnb that we plan on open sourcing. The TL;DR is that it works by having the Sensor task run briefly and materialize some state into the DB which master sensor tasks poke for. This can be with custom time intervals.

Re: [ANNOUNCE] New PMC member: Brent Bovenzi

2023-03-15 Thread Alex Liotta
Congrats Brent! On Tue, Mar 14, 2023 at 8:50 PM Jarek Potiuk wrote: > Hello Airflow Community, > > I have the pleasure to announce that The Project Management Committee > (PMC) for Apache Airflow has invited Brent Bovenzi to become Apache > Airflow PMC Member and we are pleased to announce that

Airflow Mentors for MLH Fellowship

2023-08-15 Thread Alex Gornet
Hi Airflow team, I'm Alex Gornet, a Partner Success Manager at Major League Hacking. Here at MLH we run an Open Source Fellowship program <https://fellowship.mlh.io/partners> that functions like an internship for student developers to contribute to open source projects, and we

Broken details link in 1.10.8 release notes

2024-07-03 Thread Alex Shafer
permissions error, and contains a link to a self-service signup page, which, upon selecting Airflow from the drop-down, then directs me to email this address. Any help with accessing details for this change would be much appreciated. Thanks, Alex

Re: [Newsletter] Follow-up after 3 newsletters

2021-02-23 Thread Alex Inhert
I really like the newsletter but I think it's better to be as a monthly blog post rather than something sent only to the mailing list.22.02.2021, 13:19, "Karolina" :Thanks a mill everyone, in this case I'm going to send over the newsletter summarizing the February's in the community in the first we

Re: [DISCUSS][AIP-39] Richer (and pluggable) schedule_interval on DAGs

2021-03-02 Thread Alex Inhert
Is this AIP going to co-exist with AIP-35 "Add Signal Based Scheduling To Airflow"?I think streaming was also discussed there (though it wasn't really the use case).02.03.2021, 22:10, "Ash Berlin-Taylor" :Hi Kevin,Interesting idea. My original idea was actually for "interval-less DAGs" (i.e. ones w

Re: Apache Airflow newsletter - February 2021

2021-03-03 Thread Alex Inhert
Great stuff!!!If there is a shareable link why not posting it in other media(s)?03.03.2021, 12:37, "Karolina" :Hello Canh,Thanks for your kind words, sorry for the missing link, here it is again -> https://viewstripo.email/template/9e9300ba-bc20-4de4-b8da-7c15d2759339Kind regards,  Karolina Rosół

Re: Allyship workshops for open source contributors

2021-06-09 Thread Alex Liotta
+1 On Tue, Jun 8, 2021 at 9:50 PM Vikram Koka wrote: > +1 > > > On Fri, Jun 4, 2021 at 6:49 AM Jake Ferriero > wrote: > >> +1 >> >> On Thu, Jun 3, 2021 at 6:09 PM Paola Peraza Calderon >> wrote: >> >>> +1 >>> >>> On Thu, Jun 3, 2021 at 4:53 AM Jarek Potiuk wrote: >>> +1 śr., 2

Re: Roadmap ideas for Airflow 2.2 and beyond

2021-06-16 Thread Alex Inhert
I have two points:1. The SLA feature in Airflow is very weird and limited. It doesn't really give SLA as you would expect.2. The dashboards in Airflow are very basic which force users to do self development of dashboards with external tool. You have no way to know what DAG is draining your resource

Re: [DISCUSS] Connection extra field widgets: long vs short name convention

2021-11-22 Thread Alex Begg
t names if you are not using the UI, since you can’t edit the connections via UI anyways. Alex On Mon, Nov 22, 2021 at 8:01 AM Jarek Potiuk wrote: > Yeah. We really need to clean it up - but I am afraid without fixing > the UI for managing the connections in a sane way, the status quo &g

Re: [DISCUSSION] Specify tasks to skip when triggering DAG

2022-01-28 Thread Alex Begg
on the dialog window to change state of a task is a bit cluttered looking. There probably can be a better UI/UX for that, but I do think being able to check/uncheck downstream task is a way to go, that seems like it will be just as cluttered. Alex Begg On Fri, Jan 28, 2022 at 11:46 AM Hongyi Wang

Re: [DISCUSSION] Specify tasks to skip when triggering DAG

2022-01-28 Thread Alex Begg
Actually, sorry, you can scratch out some of what I just said, I thought you were talking about clearing states, you are instead referring to triggering a DAG run. That does kind of make sense to have a way to trigger a DAG run but only run specific tasks. On Fri, Jan 28, 2022 at 1:41 PM Alex

Re: AIP 39: schedule_date renamed to logical_date

2022-02-07 Thread Alex Begg
ical_date". It will help immensely to have this clarified in the documentation because I am sure a lot others have a bit of confusion regarding this but are just staying quiet. Thanks, Alex On Mon, Feb 7, 2022 at 8:02 AM Jarek Potiuk wrote: > You have my axe :) > > On Mon, F

Re: Markdown in doc_md

2022-02-14 Thread Alex Begg
: https://github.com/apache/airflow/blob/f217becdfc371ea18486886cc3b2f47eeda0f77f/airflow/www/utils.py#L411 To support additional extensions someone would need to add that ability. It possibly can be done as a configuration setting for Airflow to customize that list. Someone would need to make that PR. I haven;t played around with doc_

Re: [DISCUSS] Buster / Bullseye Debian support

2022-02-15 Thread Alex Begg
msodbcsql17 is in multiple places in the Airflow code base. Instructions for installing msodbcsql18 in Debian are here: https://docs.microsoft.com/en-us/sql/connect/odbc/linux-mac/installing-the-microsoft-odbc-driver-for-sql-server?view=sql-server-ver15#debian18 Alex On Tue, Feb 15, 2022 at 1:32 AM Jarek

Re: [DISCUSS] Buster / Bullseye Debian support

2022-02-16 Thread Alex Begg
. Already did :) > > On Wed, Feb 16, 2022 at 6:22 AM Alex Begg wrote: > >> I wanted to point out that not only did Microsoft add Debian Bullseye >> support for MSSQL today, to do so they did a major version >> upgrade, from msodbcsql17 to msodbcsql18. So in addition to up

Re: Make first dag run optional when catchup is False

2022-03-19 Thread Alex Begg
then let you also specify a specific past interval based on the DAG's schedule. I often had to debug a DAG in production and I wanted to trigger a specific past data interval, not just the most recent. Alex Begg On Thu, Mar 17, 2022 at 4:58 PM Larry Komenda < avoicelikerunningwa...@gmail.co

[DISCUSS] Adding Delta Sharing provider

2022-04-03 Thread Alex Ott
sts>, the system test was added to make sure that all changes are tested. This system test uses a public endpoint of reference Delta Sharing implementation, and needs only a network connectivity and small CPU and local disk resources. -- With best wishes,Alex Ott htt

Re: [DISCUSS] Approach for new providers of the community

2022-04-13 Thread Alex Ott
t;> Airflow Community Provider. >> >> And I am not really sure what Flyte, Delta Sharing, Versatile Data Kit, >> and Cloudera people think and why they think this is the best choice. >> >> I think when we understand what the "Service Providers" want to

Is there a standard way of getting the provider version from inside of provider?

2022-07-10 Thread Alex Ott
rovidersManager manager = ProvidersManager() package_name = manager.hooks[DbtCloudHook.conn_type].package_name # type: ignore[union-attr] provider = manager.providers[package_name] return package_name, provider.version Would it be recommended approach? -- With best wishes,

Re: Is there a standard way of getting the provider version from inside of provider?

2022-07-10 Thread Alex Ott
e and local > virtualenv when you install airflow with `pip install -e` > > J. > > > On Sun, Jul 10, 2022 at 3:09 PM Alex Ott wrote: > > > > Hello > > > > I want to add to Databricks more details about version of Airflow & > provider itself, so

Re: question on an embarrassingly parallelism

2019-02-05 Thread Alex Guziel
The scheduler isn't guaranteed to compute them in that order to maximize parallelism. You can imagine in the case where m = n -1, that it just computes the m branches in parallel, then it has to complete the nth branch with parallelism 1. On Tue, Feb 5, 2019 at 7:20 AM soma dhavala wrote: > Imag

Re: [Discuss] Airflow sensor optimization

2019-03-06 Thread Alex Guziel
Smart sensor seems like a good idea, but I wonder how much performance will be improved in practice. And of course, one must think about sharding and such. I'm not sure how helpful rescheduling sensors is, since it will add scheduler and DB load seemingly, which is already a bottleneck. On Wed, M

Re: [Discuss] Airflow sensor optimization

2019-03-06 Thread Alex Guziel
Sensor-service thing seems to open the door to make sensors a pubsub-type deal where possible. For example, in Hive, you can keep an in-memory registry of what partitions to sense for, and tail the audit log to see when they are populated, instead of polling. On Wed, Mar 6, 2019 at 1:51 PM Alex

Re: Database referral integrity

2019-04-10 Thread Alex Guziel
I'm not a huge fan of having foreign keys. I know Airbnb has and definitely still has problems with DB load. I don't see any real convincing arguments for how adding referential integrity will improve Airflow meaningfully (yet). On Wed, Apr 10, 2019 at 12:38 PM Bas Harenslak < basharens...@godatad

Re: Database referral integrity

2019-04-10 Thread Alex Guziel
flow > clusters. > > On Wed, Apr 10, 2019 at 1:05 PM Alex Guziel .invalid> > wrote: > > > I'm not a huge fan of having foreign keys. I know Airbnb has and > definitely > > still has problems with DB load. I don't see any real convincing > arguments &g

Re: [ANNOUNCE] Please welcome new Airflow committer Kevin Yang

2019-04-30 Thread Alex Guziel
Congratulations Kevin! On Tue, Apr 30, 2019 at 10:58 AM Tao Feng wrote: > Congrats! > > On Tue, Apr 30, 2019 at 10:09 AM Daniel Imberman < > dimberman.opensou...@gmail.com> wrote: > > > Congrats Kevin! > > > > On Tue, Apr 30, 2019 at 9:09 AM Aizhamal Nurmamat kyzy > > wrote: > > > > > Congratul

Re: [DISCUSS] Tweaks to the Airflow logo

2019-08-20 Thread Alex Guziel
Latest one looks great. On Tue, Aug 20, 2019 at 11:22 AM Aizhamal Nurmamat kyzy wrote: > Great job Chris! Love it :) Thank you for your patience and such a big > contribution! > > On Tue, Aug 20, 2019 at 10:45 AM Jarek Potiuk > wrote: > > > All for it :) > > > > On Tue, Aug 20, 2019 at 1:08 PM

[USE CASE][DISCUSS] AIP-76 Partitioned Data Processing and Micro-pipelines

2024-10-03 Thread Alex Guglielmone
# [...] schedule=aaa_asset & bbb_asset, # [...] It would be a parameterized partition based run. data_transformations >> metadata_creation ``` - Is my interpretation correct? - Are we thinking of a UI that would allow us to represent things cleanly in the grid to be able to differentiate/group the runs for different regions that occur within the same day (In the general case, grouping partitions in the UI by their TimeInterval segment)? Best, Alex

Re: Re: [DISCUSS] Sensor Improvements With Tirggers

2024-10-03 Thread Alex Guglielmone
Hi, Apologies for the "name only" in the "from" field. I switched email clients between sending this and it wasn't properly configured. It should now show the full name. Best, Alex On 3 October 2024 04:57:10 BST, Alex wrote: >Hi Pavan, > >Cool AIP

Airflow Mentors for Summer 2025 MLH Fellowship

2025-04-03 Thread Alex Gornet
Hi Airflow team, I'm Alex Gornet, a Partner Success Manager at Major League Hacking. Here at MLH we run an Open Source Fellowship program <https://fellowship.mlh.io/partners> that functions like an internship for student developers to contribute to open source projects. *Fellows from

Airflow growth graphic?

2020-02-26 Thread Alex Tronchin-James 949-412-7220
Hello and happy Wednesday, Airflow community mind! I saw a graphic recently (maybe even on this list) showing the number of committers or commits over time for airflow, oozie, and a couple of other open source scheduling projects, but couldn't find it anywhere this morning. If I can find it I was

Re: Airflow growth graphic?

2020-02-26 Thread Alex Tronchin-James 949-412-7220
p;dagster-io/dagster&PrefectHQ/prefect > [image: Screen Shot 2020-02-26 at 12.34.01 PM.png] > > On Wed, Feb 26, 2020 at 12:06 PM Alex Tronchin-James 949-412-7220 < > alex.n.ja...@gmail.com> wrote: > >> Hello and happy Wednesday, Airflow community mind! >> >> I saw