[ 
https://issues.apache.org/jira/browse/IMPALA-13799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17930881#comment-17930881
 ] 

ASF subversion and git services commented on IMPALA-13799:
----------------------------------------------------------

Commit bde8cc4ae41fad03e1456ed35a63eaea8e2f9af5 in impala's branch 
refs/heads/master from stiga-huang
[ https://gitbox.apache.org/repos/asf?p=impala.git;h=bde8cc4ae ]

IMPALA-13799: Bumps timeout in waiting for catalog updates in tests

EventProcessorUtils.wait_for_event_processing() is used in tests to wait
for HMS events being processed by catalogd and all impalads receive the
catalog updates. Currently, the timeout in waiting for catalog updates
is 10s. However, there are some e2e tests like
test_overlap_min_max_filters that run DDL/DMLs longer than 10s, which
could block the catalog update for longer than 10s. When this util
method is used in e2e tests, it could be impacted by other concurrent
tests and time out.

This patch deflake the issue by bumping the timeout to be 20s.

Change-Id: If6a785e6d98572bf1a3fa3efc81d712c7ecc488e
Reviewed-on: http://gerrit.cloudera.org:8080/22547
Reviewed-by: Impala Public Jenkins <[email protected]>
Tested-by: Quanlong Huang <[email protected]>


> Flakiness in TestTestcaseBuilder.test_query_with_tbls due to slow catalog 
> updates
> ---------------------------------------------------------------------------------
>
>                 Key: IMPALA-13799
>                 URL: https://issues.apache.org/jira/browse/IMPALA-13799
>             Project: IMPALA
>          Issue Type: Bug
>          Components: Test
>            Reporter: Quanlong Huang
>            Assignee: Quanlong Huang
>            Priority: Critical
>         Attachments: 
> catalogd.cad2c3d4614e.invalid-user.log.INFO.20250225-091541.1.gz
>
>
> Saw TestTestcaseBuilder.test_query_with_tbls failed in an unrelated patch:
> [https://jenkins.impala.io/job/ubuntu-20.04-dockerised-tests/2977/testReport/junit/metadata.test_testcase_builder/TestTestcaseBuilder/test_query_with_tbls/]
> Stacktrace:
> {noformat}
> metadata/test_testcase_builder.py:60: in test_query_with_tbls
>     [unique_database + ".alltypes"], [unique_database + ".alltypes_view"])
> metadata/test_testcase_builder.py:106: in _test_export_and_import
>     EventProcessorUtils.wait_for_event_processing(self)
> util/event_processor_utils.py:93: in wait_for_event_processing
>     impala_cluster, timeout, error_status_possible)
> util/event_processor_utils.py:112: in wait_for_event_processing_impl
>     allow_greater=True)
> common/impala_service.py:145: in wait_for_metric_value
>     self.__metric_timeout_assert(metric_name, expected_value, timeout, value)
> common/impala_service.py:214: in __metric_timeout_assert
>     assert 0, assert_string
> E   AssertionError: Metric catalog.curr-version did not reach value 12461 in 
> 10s. Actual value was '12455'.
> E   Dumping debug webpages in JSON format...{noformat}
> Standard Error:
> {code:java}
> set_option('client_identifier', 
> 'metadata/test_testcase_builder.py::TestTestcaseBuilder::()::test_query_with_tbls')
> -- connecting to: localhost:21000
> set_option('client_identifier', 
> 'metadata/test_testcase_builder.py::TestTestcaseBuilder::()::test_query_with_tbls')
> -- connecting to localhost:21050 with impyla
> set_option('client_identifier', 
> 'metadata/test_testcase_builder.py::TestTestcaseBuilder::()::test_query_with_tbls')
> -- connecting to localhost:28000 with impyla
> set_option('client_identifier', 
> 'metadata/test_testcase_builder.py::TestTestcaseBuilder::()::test_query_with_tbls')
> set_option('client_identifier', 
> 'metadata/test_testcase_builder.py::TestTestcaseBuilder::()::test_query_with_tbls')
> set_option('client_identifier', 
> 'metadata/test_testcase_builder.py::TestTestcaseBuilder::()::test_query_with_tbls')
> set_option('client_identifier', 
> 'metadata/test_testcase_builder.py::TestTestcaseBuilder::()::test_query_with_tbls')
> -- connecting to localhost:21050 with impyla
> set_option('client_identifier', 
> 'metadata/test_testcase_builder.py::TestTestcaseBuilder::()::test_query_with_tbls')
> set_option('sync_ddl', 'False')
> -- executing against Impala at localhost:21050
> DROP DATABASE IF EXISTS `test_query_with_tbls_954fd7ae` CASCADE;
> -- a043740ae0ba33a3:8a68fbb500000000: getting log for operation
> -- a043740ae0ba33a3:8a68fbb500000000: getting runtime profile operation
> -- a043740ae0ba33a3:8a68fbb500000000: closing query for operation
> -- executing against Impala at localhost:21050
> CREATE DATABASE `test_query_with_tbls_954fd7ae`;
> -- 3746f4281f26eb84:4ed9e8ef00000000: getting log for operation
> -- 3746f4281f26eb84:4ed9e8ef00000000: getting runtime profile operation
> -- 3746f4281f26eb84:4ed9e8ef00000000: closing query for operation
> -- 2025-02-25 10:14:42,678 INFO     MainThread: Created database 
> "test_query_with_tbls_954fd7ae" for test ID 
> "metadata/test_testcase_builder.py::TestTestcaseBuilder::()::test_query_with_tbls"
> -- closing connection to: localhost:21050
> -- executing against localhost:21000
> create table test_query_with_tbls_954fd7ae.alltypes partitioned by (`year`, 
> `month`) as select * from functional.alltypes;
> -- 2025-02-25 10:14:42,755 INFO     MainThread: Started query 
> 804b410087862222:6bdf0f3600000000
> -- executing against localhost:21000
> create view test_query_with_tbls_954fd7ae.alltypes_view as select * from 
> test_query_with_tbls_954fd7ae.alltypes;
> -- 2025-02-25 10:14:43,114 INFO     MainThread: Started query 
> ce44c036e726876c:2612e06500000000
> -- executing against localhost:21000
> COPY TESTCASE TO '/tmp' select count(*) from 
> test_query_with_tbls_954fd7ae.alltypes_view;
> -- 2025-02-25 10:14:43,209 INFO     MainThread: Started query 
> 3e4d8fc01a0e5684:32848eb900000000
> set_option('client_identifier', 
> 'metadata/test_testcase_builder.py::TestTestcaseBuilder::()::test_query_with_tbls')
> set_option('explain_level', '2')
> -- executing against localhost:21000
> explain select count(*) from test_query_with_tbls_954fd7ae.alltypes_view;
> -- 2025-02-25 10:14:43,219 INFO     MainThread: Started query 
> 474a11ea3c5669b5:cbb9da1500000000
> -- executing against localhost:21000
> drop table if exists test_query_with_tbls_954fd7ae.alltypes;
> -- 2025-02-25 10:14:43,223 INFO     MainThread: Started query 
> fa43aa6c56b5353c:cf114cec00000000
> -- executing against localhost:21000
> drop view if exists test_query_with_tbls_954fd7ae.alltypes_view;
> -- 2025-02-25 10:14:43,329 INFO     MainThread: Started query 
> 404ec95510164fe8:a341345d00000000
> -- 2025-02-25 10:14:43,625 DEBUG    MainThread: Found 3 impalad/1 
> statestored/1 catalogd process(es)
> -- 2025-02-25 10:14:43,636 INFO     MainThread: Waiting until events 
> processor syncs to event id:43830
> -- 2025-02-25 10:14:44,189 DEBUG    MainThread: Metric last-synced-event-id 
> has reached the desired value: 43830
> -- 2025-02-25 10:14:44,212 INFO     MainThread: Getting metric: 
> catalog.curr-version from ip-172-31-54-194:25000
> -- 2025-02-25 10:14:44,216 INFO     MainThread: Sleeping 1s before next retry.
> -- 2025-02-25 10:14:45,218 INFO     MainThread: Getting metric: 
> catalog.curr-version from ip-172-31-54-194:25000
> -- 2025-02-25 10:14:45,231 INFO     MainThread: Sleeping 1s before next retry.
> -- 2025-02-25 10:14:46,232 INFO     MainThread: Getting metric: 
> catalog.curr-version from ip-172-31-54-194:25000
> -- 2025-02-25 10:14:46,243 INFO     MainThread: Sleeping 1s before next retry.
> -- 2025-02-25 10:14:47,243 INFO     MainThread: Getting metric: 
> catalog.curr-version from ip-172-31-54-194:25000
> -- 2025-02-25 10:14:47,248 INFO     MainThread: Sleeping 1s before next retry.
> -- 2025-02-25 10:14:48,249 INFO     MainThread: Getting metric: 
> catalog.curr-version from ip-172-31-54-194:25000
> -- 2025-02-25 10:14:48,254 INFO     MainThread: Sleeping 1s before next retry.
> -- 2025-02-25 10:14:49,255 INFO     MainThread: Getting metric: 
> catalog.curr-version from ip-172-31-54-194:25000
> -- 2025-02-25 10:14:49,260 INFO     MainThread: Sleeping 1s before next retry.
> -- 2025-02-25 10:14:50,261 INFO     MainThread: Getting metric: 
> catalog.curr-version from ip-172-31-54-194:25000
> -- 2025-02-25 10:14:50,266 INFO     MainThread: Sleeping 1s before next retry.
> -- 2025-02-25 10:14:51,270 INFO     MainThread: Getting metric: 
> catalog.curr-version from ip-172-31-54-194:25000
> -- 2025-02-25 10:14:51,277 INFO     MainThread: Sleeping 1s before next retry.
> -- 2025-02-25 10:14:52,278 INFO     MainThread: Getting metric: 
> catalog.curr-version from ip-172-31-54-194:25000
> -- 2025-02-25 10:14:52,282 INFO     MainThread: Sleeping 1s before next retry.
> -- 2025-02-25 10:14:53,283 INFO     MainThread: Getting metric: 
> catalog.curr-version from ip-172-31-54-194:25000
> -- 2025-02-25 10:14:53,286 INFO     MainThread: Sleeping 1s before next retry.
> -- 2025-02-25 10:14:54,287 INFO     MainThread: Metric catalog.curr-version 
> did not reach value 12461 in 10s. Actual value was '12455'. Failing... {code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to