+1 Default retry on failing test should help. Another way to identify test
issues is to run them in the same batch as being run in the precommit job.

On Tue, May 15, 2018 at 4:11 PM, Deepak Jaiswal <djais...@hortonworks.com>
wrote:

> +1
>
> On 5/15/18, 4:06 PM, "Jesus Camacho Rodriguez" <jcama...@apache.org>
> wrote:
>
>     That is already part of the policy although it apparently remained
> unnoticed:
>
>     *If a commit introduces new test failures, the preferred process is to
> revert the patch, rather than opening a new JIRA to fix the new failures.*
>
>     It can be reworded to be stricter... But in any case, we should all
> enforce it from now on.
>
>     -Jesús
>
>
>     On 5/15/18, 3:55 PM, "Sergey Shelukhin" <ser...@hortonworks.com>
> wrote:
>
>         +1. Can we also add something about revert-first policy if some
> patch
>         breaks tests?
>         So that it’s ok to revert if the tests aren’t fixed quickly with a
>         follow-up.
>
>         On 18/5/15, 13:08, "Jesus Camacho Rodriguez" <jcama...@apache.org>
> wrote:
>
>         >I was hoping that by being stricter, we are going to make an
> effort to
>         >fix the flaky
>         >tests. The reason is precisely what you mention: if you cannot
> commit,
>         >you need
>         >to fix this situation. That is what I meant with providing an
> additional
>         >incentive to
>         >make testing more robust: currently there is no incentive. I do
> not think
>         >that
>         >improving tests is a responsibility of a single developer, but
> rather a
>         >responsibility
>         >of all of us. Disabling the tests is a one-time solution to get
> to a
>         >clean run, trying to
>         >accelerate the process to get to it, as we did not want to block
>         >development for
>         >weeks. Then flaky tests should just not go in, and if they do, we
> can
>         >just revert
>         >the patch (this is what [2] says btw).
>         >
>         >The first thing we need to do is identifying why a test is flaky.
> After
>         >examining runs
>         >for the last few days, I saw many of them fall in following
> categories:
>         >- Many of them are flaky because of estimations such as data
> size. One
>         >possible
>         >solution is to mask data size for those tests, as we already mask
> some
>         >environment
>         >dependent information.
>         >- Some of them are flaky because environment issues, e.g., I see
> this a
>         >lot with
>         >TestTriggersMoveWorkloadManager. If their logic cannot be
> rewritten, a
>         >possible
>         >solution is to add a max number of retries selectively for those
> tests
>         >(surefire has
>         >a rerunFailingTestsCount option that I am not familiar with),
> expecting
>         >that they
>         >pass at least once.
>         >
>         >Not sure if you have other ideas?
>         >
>         >-Jesús
>         >
>         >
>         >On 5/15/18, 11:59 AM, "Vihang Karajgaonkar" <vih...@cloudera.com>
> wrote:
>         >
>         >    Can we also define a standard process to identify a flaky
> test and
>         >thereby
>         >    making it eligible to be disabled? I am worried that the
> intermittent
>         >the
>         >    flaky ones will stall the patches when we restart allowing the
>         >commits.
>         >
>         >    On Tue, May 15, 2018 at 10:50 AM, Vineet Garg <
> vg...@hortonworks.com>
>         >wrote:
>         >
>         >    > +1
>         >    >
>         >    > > On May 15, 2018, at 9:13 AM, Alan Gates <
> alanfga...@gmail.com>
>         >wrote:
>         >    > >
>         >    > > +1.
>         >    > >
>         >    > > Alan.
>         >    > >
>         >    > > On Tue, May 15, 2018 at 9:12 AM, Sergio Pena
>         ><sergio.p...@cloudera.com>
>         >    > > wrote:
>         >    > >
>         >    > >> +1
>         >    > >>
>         >    > >> On Tue, May 15, 2018 at 11:05 AM, Gunther Hagleitner <
>         >    > >> ghagleit...@hortonworks.com> wrote:
>         >    > >>
>         >    > >>> +1
>         >    > >>> ________________________________________
>         >    > >>> From: Sankar Hariappan <shariap...@hortonworks.com>
>         >    > >>> Sent: Tuesday, May 15, 2018 9:03 AM
>         >    > >>> To: dev@hive.apache.org
>         >    > >>> Subject: Re: [VOTE] Stricter commit guidelines
>         >    > >>>
>         >    > >>> +1
>         >    > >>>
>         >    > >>>
>         >    > >>> On 15/05/18, 9:30 PM, "Sahil Takiar" <
> takiar.sa...@gmail.com>
>         >wrote:
>         >    > >>>
>         >    > >>>> +1
>         >    > >>>>
>         >    > >>>> On Tue, May 15, 2018 at 10:56 AM, Owen O'Malley <
>         >    > owen.omal...@gmail.com
>         >    > >>>
>         >    > >>>> wrote:
>         >    > >>>>
>         >    > >>>>> +1
>         >    > >>>>>
>         >    > >>>>> On Tue, May 15, 2018 at 8:55 AM, Peter Vary
>         ><pv...@cloudera.com>
>         >    > >> wrote:
>         >    > >>>>>
>         >    > >>>>>> +1 - Hoping for something like this for a long
> while! Thanks
>         >for
>         >    > >>> taking
>         >    > >>>>>> this up all!
>         >    > >>>>>>
>         >    > >>>>>>> On May 15, 2018, at 5:44 PM, Jesus Camacho
> Rodriguez <
>         >    > >>>>>> jcama...@apache.org> wrote:
>         >    > >>>>>>>
>         >    > >>>>>>> Forgot to mention the length of the vote in original
>         >message.
>         >    > >>>>>>>
>         >    > >>>>>>> Let's leave the vote open for a shorter period than
> usual,
>         >for
>         >    > >>> instance
>         >    > >>>>>> 48 hours, i.e., till Wednesday 10pm PST. Situation
> can only
>         >get
>         >    > >> worse
>         >    > >>>>> than
>         >    > >>>>>> it is now if we do not take action for a longer
> period.
>         >    > >>>>>>>
>         >    > >>>>>>> As Alan suggested, vote passes if there is a lazy
> majority
>         >(at
>         >    > >>> least 3
>         >    > >>>>>> votes, more +1s than -1s).
>         >    > >>>>>>>
>         >    > >>>>>>> Thanks,
>         >    > >>>>>>> Jesús
>         >    > >>>>>>>
>         >    > >>>>>>>
>         >    > >>>>>>> On 5/15/18, 8:37 AM, "Andrew Sherman"
>         ><asher...@cloudera.com>
>         >    > >>> wrote:
>         >    > >>>>>>>
>         >    > >>>>>>>   +1
>         >    > >>>>>>>
>         >    > >>>>>>>   On Tue, May 15, 2018 at 2:34 AM Rui Li
>         ><lirui.fu...@gmail.com>
>         >    > >>>>> wrote:
>         >    > >>>>>>>
>         >    > >>>>>>>> +1
>         >    > >>>>>>>>
>         >    > >>>>>>>> On Tue, May 15, 2018 at 2:24 PM, Prasanth
> Jayachandran <
>         >    > >>>>>>>> pjayachand...@hortonworks.com> wrote:
>         >    > >>>>>>>>
>         >    > >>>>>>>>> +1
>         >    > >>>>>>>>>
>         >    > >>>>>>>>>
>         >    > >>>>>>>>>
>         >    > >>>>>>>>> Thanks
>         >    > >>>>>>>>> Prasanth
>         >    > >>>>>>>>>
>         >    > >>>>>>>>>
>         >    > >>>>>>>>>
>         >    > >>>>>>>>> On Mon, May 14, 2018 at 10:44 PM -0700, "Jesus
> Camacho
>         >    > >> Rodriguez"
>         >    > >>> <
>         >    > >>>>>>>>> jcama...@apache.org<mailto:jcama...@apache.org>>
> wrote:
>         >    > >>>>>>>>>
>         >    > >>>>>>>>>
>         >    > >>>>>>>>> After work has been done to ignore most of the
> tests that
>         >were
>         >    > >>>>> failing
>         >    > >>>>>>>>> consistently/intermittently [1], I wanted to
> start this
>         >vote to
>         >    > >>>>> gather
>         >    > >>>>>>>>> support from the community to be stricter wrt
> committing
>         >patches
>         >    > >>> to
>         >    > >>>>>> Hive.
>         >    > >>>>>>>>> The committers guide [2] already specifies that a
> +1
>         >should be
>         >    > >>>>> obtained
>         >    > >>>>>>>>> before committing, but there is another clause
> that allows
>         >    > >>> committing
>         >    > >>>>>>>> under
>         >    > >>>>>>>>> the presence of flaky tests (clause 4). Flaky
> tests are
>         >as good
>         >    > >> as
>         >    > >>>>>> having
>         >    > >>>>>>>>> no tests, hence I propose to remove clause 4 and
> enforce
>         >the +1
>         >    > >>> from
>         >    > >>>>>>>>> testing infra before committing.
>         >    > >>>>>>>>>
>         >    > >>>>>>>>>
>         >    > >>>>>>>>>
>         >    > >>>>>>>>> As I see it, by enforcing that we always get a +1
> from the
>         >    > >> testing
>         >    > >>>>>> infra
>         >    > >>>>>>>>> before committing, 1) we will have a more stable
> project,
>         >and 2)
>         >    > >>> we
>         >    > >>>>>> will
>         >    > >>>>>>>>> have another incentive as a community to create a
> more
>         >robust
>         >    > >>> testing
>         >    > >>>>>>>>> infra, e.g., replacing flaky tests for similar
> unit tests
>         >that
>         >    > >> are
>         >    > >>>>> not
>         >    > >>>>>>>>> flaky, trying to decrease running time for tests,
> etc.
>         >    > >>>>>>>>>
>         >    > >>>>>>>>>
>         >    > >>>>>>>>>
>         >    > >>>>>>>>> Please, share your thoughts about this.
>         >    > >>>>>>>>>
>         >    > >>>>>>>>>
>         >    > >>>>>>>>>
>         >    > >>>>>>>>> Here is my +1.
>         >    > >>>>>>>>>
>         >    > >>>>>>>>>
>         >    > >>>>>>>>>
>         >    > >>>>>>>>> Thanks,
>         >    > >>>>>>>>>
>         >    > >>>>>>>>> Jes?s
>         >    > >>>>>>>>>
>         >    > >>>>>>>>>
>         >    > >>>>>>>>>
>         >    > >>>>>>>>> [1]
>         >http://mail-archives.apache.org/mod_mbox/hive-dev/201805.
>         >    > >>>>>>>>>
>         >mbox/%3C63023673-AEE5-41A9-BA52-5A5DFB2078B6%40apache.org%3E
>         >    > >>>>>>>>>
>         >    > >>>>>>>>> [2] https://cwiki.apache.org/
> confluence/display/Hive/
>         >    > >>>>>>>>> HowToCommit#HowToCommit-PreCommitruns,
> andcommittingpatches
>         >    > >>>>>>>>>
>         >    > >>>>>>>>>
>         >    > >>>>>>>>>
>         >    > >>>>>>>>>
>         >    > >>>>>>>>>
>         >    > >>>>>>>>
>         >    > >>>>>>>>
>         >    > >>>>>>>> --
>         >    > >>>>>>>> Best regards!
>         >    > >>>>>>>> Rui Li
>         >    > >>>>>>>>
>         >    > >>>>>>>
>         >    > >>>>>>>
>         >    > >>>>>>>
>         >    > >>>>>>
>         >    > >>>>>>
>         >    > >>>>>
>         >    > >>>>
>         >    > >>>>
>         >    > >>>>
>         >    > >>>> --
>         >    > >>>> Sahil Takiar
>         >    > >>>> Software Engineer
>         >    > >>>> takiar.sa...@gmail.com | (510) 673-0309
>         >    > >>>
>         >    > >>>
>         >    > >>>
>         >    > >>
>         >    >
>         >    >
>         >
>         >
>         >
>
>
>
>
>
>
>

Reply via email to