Maybe I need to get used to the `escape hatch` (which I don't really like
tbh), but I've been having a really hard time not being able to merge
things with unrelated CI failure:
-  Asking people to rebase again numerous time, until CI gets green, for
instance https://github.com/apache/airflow/pull/50014. (flaky tests, or
main broken)
- CI Bug equals PR can't be merged until it's resolved and re-run again for
instance https://github.com/apache/airflow/pull/49866 if you take a look at
the number of rebase + close/open PR to rerun CI for a very simple change
it's not great. There was a glitch in the CI for UI only PRs


I like the auto-merge feature but I think losing the ability to force merge
when unrelated failure happens is too much of a downside IMO. I feel like
there were many more cases where I wanted to manually merge things and
couldn't compare to when I marked for auto-merge and it actually succeeded
so I would say it slowed me down in the last couple of days.

I also tend to be -1 on this.

On Wed, Apr 30, 2025 at 3:38 PM Ash Berlin-Taylor <a...@apache.org> wrote:

>
> Why was a workflow change that impacts every. single. PR. not discussed in
> this list before turning it on? I can’t think of a more impactful change to
> committers.
>
> Just last month weren’t you saying we needed to have more discussion about
> impactful change on this list?
>
> > On 30 Apr 2025, at 10:04, Jarek Potiuk <ja...@potiuk.com> wrote:
> > Yeah, And I would encourage everyone to try it and provide feedback
> while
> > it is enabled.
> >
> > So far we identified a few things (and fix them) that made it borderline
> > unusable (bug in workflow for UI-only changes, GitHub starting to 429-us
> > with too many requests, and the mysterious "hanging" of the latest
> > botocore/celery (?) "special tests" -> all that is already addressed in
> > main, and those issues should not happen (hopefully), so I'd say we can
> now
> > "truly" see how it might work.
> >
> > And one comment from my side - indeed, I find it nice actually, but it's
> > definitely not a "deal breaker"- so if others find it too disruptive, I
> am
> > definitely not going to die on this hill, I just thought it does improve
> > the workflow in the way that it allows for mostly "fire and forget" when
> > you approve the workflow, one thing that it definitely improves is that
> you
> > do not have to remember about coming back to merge a request when CI
> > succeeds.
> >
> > J.
> >
> > On Wed, Apr 30, 2025 at 10:58 AM Kaxil Naik <kaxiln...@gmail.com> wrote:
> >
> >> Forgot to note an additional point in Summary: If we find anything
> blocking
> >> us in that period, we will merge
> >> https://github.com/apache/airflow/pull/50009 to disable auto-merge.
> >>
> >> On Wed, 30 Apr 2025 at 14:26, Kaxil Naik <kaxiln...@gmail.com> wrote:
> >>
> >>> Jarek & I discussed it on Slack in #internal-airflow-ci-cd. Summary
> >> below:
> >>> I have a PR to disable it:
> https://github.com/apache/airflow/pull/50009.
> >>> However, given that many countries will be on holiday on May 1 due to
> >>> Labour Day, and some teething issues were fixed yesterday, we will let
> it
> >>> run for a few more days so other committers and contributors can get a
> >>> chance to try it out and share their experience after the
> >> experiment/trial
> >>> is concluded.
> >>> On Wed, 30 Apr 2025 at 13:59, Kaxil Naik <kaxiln...@gmail.com> wrote:
> >>>> Whoops yeah.
> >>>>> Yep. Because it did not have all conversations resolved. We also have
> >>>> "require resolved conversation" as one of the branch protection
> >>>> conditions.
> >>>> I resolved the conversation and it got merged automatically.
> >>>> Let's adapt it when ready though, I don't see any urgency of getting
> >>>> enabling auto-merge or getting it contributed immediately to asf INFRA
> >> when
> >>>> it isn't critical. It is about priortization
> >>>> I'd say - if that is really bothering us - let's shape the feature
> >> rather
> >>>>> than outright reject it.
> >>>> On Wed, 30 Apr 2025 at 12:37, Jarek Potiuk <ja...@potiuk.com> wrote:
> >>>>> On Wed, Apr 30, 2025 at 7:55 AM Kaxil Naik <kaxiln...@gmail.com>
> >> wrote:
> >>>>>> To the point that the original PR is still not merged even after I
> >> had
> >>>>>> re-triggered the failed tests yesterday:
> >>>>>> https://github.com/apache/airflow/pull/49727
> >>>>> Yep. Because it did not have all conversations resolved. We also have
> >>>>> "require resolved conversation" as one of the branch protection
> >>>>> conditions.
> >>>>> I resolved the conversation and it got merged automatically.
> >>>>>> On Wed, 30 Apr 2025 at 11:20, Kaxil Naik <kaxiln...@gmail.com>
> >> wrote:
> >>>>>>> The gitbox escape hatch isn't it though -- if we are to allow that
> >>>>> why
> >>>>>> not
> >>>>>>> just allow people to merge it directly from github that to go via
> >> an
> >>>>>>> "escape hatch".
> >>>>> Generally speaking GitHub has this option. Currently "admins" have a
> >>>>> possibility of overriding branch protection (via UI). And it would be
> >>>>> possible - if INFRA will allow it - to possibly add an .asf.yaml
> >> feature
> >>>>> to
> >>>>> also allow branch protection override for all committers or a subset
> of
> >>>>> the
> >>>>> committers (PMC Members ? ). This is more of a limitation of the
> >> current
> >>>>> implementation of permissions than a missing feature. If we all feel
> >> that
> >>>>> the gitbox escape hatch is not enough, we can likely even contribute
> >>>>> such a
> >>>>> feature to .asf.yaml - if INFRA will be ok with the option. It's very
> >>>>> easy
> >>>>> to contribute to - INFRA made it possible, we have a new framework:
> >>>>> https://github.com/apache/infrastructure-asfyaml - we can even
> >> implement
> >>>>> "airflow-only" .asf.yaml feature, that will not be initially
> available
> >> to
> >>>>> other ASF projects and later we can promote it to be available to
> >>>>> everyone.
> >>>>> I'd say - if that is really bothering us - let's shape the feature
> >> rather
> >>>>> than outright reject it.
> >>>>>>> I am -1 on this auto-merge feature
> >>>>> Understood :). But let's give it a bit more time as well and maybe
> >>>>> improve
> >>>>> it (see above) - unless we really feel we are blocked now - then it
> >>>>> should
> >>>>> be as easy as merging an .asf.yaml change to disable it.
> >>>>>>> On Wed, 30 Apr 2025 at 11:18, Kaxil Naik <kaxiln...@gmail.com>
> >>>>> wrote:
> >>>>>>>> That’s not a single person, it impacts the committers and the PR
> >>>>> author
> >>>>>>>> involved too. I don’t see how team productivity soars here.
> >>>>>>>> On Wed, 30 Apr 2025 at 02:39, Jarek Potiuk <ja...@potiuk.com>
> >>>>> wrote:
> >>>>>>>>> But yes, I also miss the previous "merge because I think it's
> >> safe"
> >>>>>>>>> workflow.
> >>>>>>>>> I badly miss it. Personally, It hurts my productivity.
> >>>>>>>>> But I think the "require status check" to be green is great for
> >>>>> "team
> >>>>>>>>> productivity". Usually when single person is impacted more than
> >>>>> team in
> >>>>>>>>> general, it's worse for the person impacted, but team
> >> productivity
> >>>>>> soars.
> >>>>>>>>> J.
> >>>>>>>>> On Tue, Apr 29, 2025 at 11:03 PM Jarek Potiuk <ja...@potiuk.com>
> >>>>>> wrote:
> >>>>>>>>>> Just to add comment:
> >>>>>>>>>> a) there was some instability of "celery/boto" hanging tests
> >>>>> today
> >>>>>>>>> that is
> >>>>>>>>>> rather difficult to address - but we worked around it by just
> >>>>>> removing
> >>>>>>>>>> "special-tests" from pre-requisite of merging
> >>>>>>>>>> b) GitHub today (like literally today!) started to be picky on
> >>>>> "too
> >>>>>>>>> many
> >>>>>>>>>> requests" - I addressed it today for both helm chart and our
> >>>>> release
> >>>>>>>>> tests
> >>>>>>>>>> (we are using bearer-token to authenticate and increase the
> >>>>> limit -
> >>>>>>>>> and we
> >>>>>>>>>> added cache on downloading json schema that was downloaded
> >>>>> "per-test"
> >>>>>>>>>> c) in cases like the one mentioned above with intermittent
> >>>>> failures -
> >>>>>>>>>> simple "rerun failed jobs" (assuming it will succeed after
> >>>>> rerun) -
> >>>>>> is
> >>>>>>>>>> essentially equivalent of "merge" (unless it fails again which
> >>>>> for me
> >>>>>>>>> is a
> >>>>>>>>>> signal of "DO NOT MERGE")
> >>>>>>>>>> d) we always have the "gitbox" escape hatch - that allows any
> >>>>>>>>> committer to
> >>>>>>>>>> push the fix directly, bypassing the limits:
> >>>>>>>>>> This is a simple thing for committers:
> >>>>>>>>>> git add remote gitbox
> >>>>>> https://gitbox.apache.org/repos/asf/airflow.git
> >>>>>>>>>> git fetch gitbox
> >>>>>>>>>> git commit --amend ("add #PR number")
> >>>>>>>>>> git push gitbox BRANCH_NAME:main (you need to provide your
> >>>>> apache id
> >>>>>>>>> and
> >>>>>>>>>> password)
> >>>>>>>>>> This is a nice escape hatch that we can use as "exceptional
> >>>>>> workflow" -
> >>>>>>>>>> and it works - I did it quite a few times over the last few
> >>>>> days. Not
> >>>>>>>>> UI
> >>>>>>>>>> controlled, but IMHO exceptional workflow should be -  well -
> >>>>>>>>> exceptional.
> >>>>>>>>>> J.
> >>>>>>>>>> On Tue, Apr 29, 2025 at 8:52 PM Kaxil Naik <
> >> kaxiln...@gmail.com>
> >>>>>>>>> wrote:
> >>>>>>>>>>> Similar experience as Elad, I am in favor of disabling it tbh.
> >>>>> For
> >>>>>>>>>>> example,
> >>>>>>>>>>> https://github.com/apache/airflow/pull/49727 has a failing
> >>>>> test as
> >>>>>>>>> below
> >>>>>>>>>>> --
> >>>>>>>>>>> which is not an issue, and test passes locally so I would want
> >>>>> to
> >>>>>>>>> merge it
> >>>>>>>>>>> but I can't.
> >>>>>>>>>>> FAILED
> >>
> helm-tests/tests/helm_tests/airflow_aux/test_basic_helm_chart.py::TestBaseChartTest::test_priority_classes
> >>>>>>>>>>> - requests.exceptions.HTTPError: 429 Client Error: Too Many
> >>>>> Requests
> >>>>>>>>> for
> >>>>>>>>>>> url:
> >>
> https://raw.githubusercontent.com/yannh/kubernetes-json-schema/master/v1.29.1-standalone-strict/priorityclass-scheduling-v1.json
> >>>>>>>>>>> On Tue, 29 Apr 2025 at 18:29, Jarek Potiuk <ja...@potiuk.com>
> >>>>>> wrote:
> >>>>>>>>>>>> On Tue, Apr 29, 2025 at 1:46 PM Elad Kalif <
> >>>>> elad...@apache.org>
> >>>>>>>>> wrote:
> >>>>>>>>>>>>> Thanks for that Jarek!
> >>>>>>>>>>>>> I find the lack of ability to merge PRs fast very limiting
> >>>>> but
> >>>>>> it
> >>>>>>>>>>> might
> >>>>>>>>>>>> be
> >>>>>>>>>>>>> just something to get used to.
> >>>>>>>>>>>> Indeed. I also see it, but also I got a few manually pushed
> >>>>> "must
> >>>>>>>>> fix
> >>>>>>>>>>>> quickly" to gitbox, and actually I find it really nice -
> >>>>> because
> >>>>>> it
> >>>>>>>>> is
> >>>>>>>>>>>> still possible, but it require some extra effort and
> >>>>> deliberate
> >>>>>> "ok
> >>>>>>>>> that
> >>>>>>>>>>>> one really should be pushed to unblock everyone" - as long
> >> as
> >>>>> we
> >>>>>> all
> >>>>>>>>>>>> (especially those people that are active in the
> >>>>>>>>>>>> #internal-airfow-ci-cd channel) know how to do it and can
> >> fix
> >>>>>> things
> >>>>>>>>>>>> quickly, this is actually a nice way to make it into
> >>>>> "exceptional"
> >>>>>>>>>>> workflow
> >>>>>>>>>>>> - which will push us more in making sure airflow main is
> >>>>> really
> >>>>>>>>> "green"
> >>>>>>>>>>> -
> >>>>>>>>>>>> which ultimately is our goal to make it as green as possible
> >>>>> all
> >>>>>> the
> >>>>>>>>>>> time.
> >>>>>>>>>>>> What **might help with that** (and also keeping the "enable
> >>>>> auto
> >>>>>>>>> merge"
> >>>>>>>>>>>> might motivate it more to) is to:
> >>>>>>>>>>>> * speed up the builds - we MUST prioritise now ARC (K8S
> >>>>>> self-hosted
> >>>>>>>>>>>> runners) to make our builds simply faster - I started a
> >>>>> discussion
> >>>>>>>>> and a
> >>>>>>>>>>>> small group of people who can work together to complete it
> >>>>> after
> >>>>>>>>>>> Hussein's
> >>>>>>>>>>>> POC)
> >>>>>>>>>>>> * speed up the image release - with ARM runners (which we
> >>>>> might be
> >>>>>>>>> able
> >>>>>>>>>>> to
> >>>>>>>>>>>> do independently as recently I think we have
> >>>>> hypervisor-enabled
> >>>>>> ARM
> >>>>>>>>>>> images
> >>>>>>>>>>>> available as public runners as github made it generally
> >>>>>> available).
> >>>>>>>>>>>> * speed up the doc builds for airflow-site - we (mainly
> >>>>> Pavan) are
> >>>>>>>>>>> close to
> >>>>>>>>>>>> complete offloading of the historical release docs to S3
> >> and I
> >>>>>> hope
> >>>>>>>>> it
> >>>>>>>>>>> will
> >>>>>>>>>>>> cut down a lot on doc publishing workflows.
> >>>>>>>>>>>> J,
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: dev-unsubscr...@airflow.apache.org
> For additional commands, e-mail: dev-h...@airflow.apache.org
>
>

Reply via email to