Thanks a lot Stamatis for starting this thread. I really appreciate all the
efforts to stabilize branch-3 to get it to a releasable state and I agree
that we should get it to a green state before opening it for PRs not
related to test failures. I can help with the effort as well.

If we want to get the branch back to green state soon, have we considered
disabling the tests which are clearly flaky? (e.g pass on some builds and
fail on the other build with no new code changes). If we don't do that, we
will keep playing whack a mole with those tests. I propose for such tests
we should disable them and create tickets to unflake them separately. This
will help us get back to a green state faster.

Hi Aman,
For TestMiniSparkOnYarnCliDriver failures, you probably should also look
into the spark driver/application logs and see if there are infrastructure
errors (e.g OOMs). Are these tests failing when you run locally?

Thanks,
Vihang

On Tue, Feb 7, 2023 at 10:05 PM Aman Raj <raja...@microsoft.com.invalid>
wrote:

> +1,
> Thanks Stamatis and Lazlo for helping in the test case fixes till now.
>
> Team,
> I need help in fixing the following tests in Hive. I have tried different
> approaches but no luck till now.
> I am facing some issues in fixing the following tests :
> org.apache.hadoop.hive.cli.TestMiniSparkOnYarnCliDriver
>
> Issue :
> PREHOOK: Input: default@src
> PREHOOK: Output: default@src
> Failed to monitor Job[-1] with exception
> 'java.lang.IllegalStateException(Connection to remote Spark driver was
> lost)' Last known state = SENT
> Failed to execute spark task, with exception
> 'java.lang.IllegalStateException(RPC channel is closed.)'
> FAILED: Execution Error, return code 1 from
> org.apache.hadoop.hive.ql.exec.spark.SparkTask. RPC channel is closed.
>
> History :
> Initially the tests had failed with errors which I fixed in the following
> task : https://issues.apache.org/jira/browse/HIVE-26940
>
> Does anyone know what the issue is here ? There are 6-7 failures because
> of this test case. Link to the failed test cases for the stacktrace :
> http://ci.hive.apache.org/blue/organizations/jenkins/hive-precommit/detail/PR-3949/2/tests/
> Thanks,
> Aman.
>
> ________________________________
> From: László Bodor <bodorlaszlo0...@gmail.com>
> Sent: Tuesday, February 7, 2023 4:46 PM
> To: dev@hive.apache.org <dev@hive.apache.org>
> Subject: [EXTERNAL] Re: Branch-3 backports and build stability
>
> +1
> also, if I merged something that I thought was for test stability (but
> instead it was a feature), excuse me :)
> for reference, the whole green test initiative is tracked under this
> umbrella:
> https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Fissues.apache.org%2Fjira%2Fbrowse%2FHIVE-26836&data=05%7C01%7Crajaman%40microsoft.com%7Cc1cbb508eee74c3347e508db08fcdfef%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C638113654431055909%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=Ztfbm5%2FjUJV5v083%2BFu5%2Fs7mqVBEgCgEBeo5BJFzS8A%3D&reserved=0
>
> Stamatis Zampetakis <zabe...@gmail.com> ezt írta (időpont: 2023. febr. 7.,
> K, 12:09):
>
> > Hi all,
> >
> > The build in branch-3 is not yet green; there are ~25 test failures. It
> is
> > a common practice that we shouldn't push changes on top of a broken build
> > unless they are addressing test failures.
> >
> > Some people (mainly Aman Raj, Chris Nauroth, and Laszlo Bodor) are
> working
> > hard to stabilize the build for quite some time now. If you want to help
> > out then start by reviewing, merging, and fixing things around test
> > failures.
> >
> > It's not yet the time to bring new features, upgrades, bugs, etc., in
> > branch-3. I would encourage  committers to not approve such changes till
> we
> > get back to a stable branch.
> >
> > Best,
> > Stamatis
> >
>

Reply via email to