[ https://issues.apache.org/jira/browse/IMPALA-13926?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Riza Suminto resolved IMPALA-13926. ----------------------------------- Fix Version/s: Impala 5.0.0 Target Version: Impala 5.0.0 Resolution: Fixed > Test TestWorkloadManagementInitNoWait failed in arm s3 builds > ------------------------------------------------------------- > > Key: IMPALA-13926 > URL: https://issues.apache.org/jira/browse/IMPALA-13926 > Project: IMPALA > Issue Type: Bug > Components: Backend > Reporter: Yida Wu > Assignee: Riza Suminto > Priority: Major > Labels: broken-build, test-failure > Fix For: Impala 5.0.0 > > > TestWorkloadManagementInitNoWait has been observed to fail in certain builds, > seems related to arm, s3, and data cache, with the following error messages. > {code:java} > custom_cluster.test_workload_mgmt_init.TestWorkloadManagementInitNoWait.test_start_invalid_version > {code} > Error Message > {code:java} > test setup failure > {code} > Stacktrace > {code:java} > custom_cluster/test_workload_mgmt_init.py:533: in teardown_method > self.wait_for_wm_idle() > common/custom_cluster_test_suite.py:446: in wait_for_wm_idle > "impala-server.completed-queries.queued", 0, timeout=timeout_s, > interval=1) > common/impala_service.py:147: in wait_for_metric_value > self.__metric_timeout_assert(metric_name, expected_value, timeout, value) > common/impala_service.py:183: in __metric_timeout_assert > self.dump_debug_webpage_json(debug_page, json_filename) > common/impala_service.py:88: in dump_debug_webpage_json > debug_json = self.get_debug_webpage_json(page_name) > common/impala_service.py:83: in get_debug_webpage_json > return json.loads(self.read_debug_webpage(page_name + "?json")) > common/impala_service.py:79: in read_debug_webpage > return self.open_debug_webpage(page_name, timeout=timeout, > interval=interval).text > common/impala_service.py:76: in open_debug_webpage > assert 0, 'Debug webpage did not become available in expected time.' > E AssertionError: Debug webpage did not become available in expected time. > {code} > Standard Error > {code:java} > -- 2025-03-29 19:10:02,456 INFO MainThread: Created temporary dir > /data/jenkins/workspace/impala-cdw-master-core-s3-arm-data-cache/repos/Impala/logs/custom_cluster_tests/impala_test_minidumps_QqiGby > -- 2025-03-29 19:10:02,456 INFO MainThread: Starting cluster with > command: > /data/jenkins/workspace/impala-cdw-master-core-s3-arm-data-cache/repos/Impala/bin/start-impala-cluster.py > '--state_store_args=--statestore_update_frequency_ms=50 > --statestore_priority_update_frequency_ms=50 > --statestore_heartbeat_frequency_ms=50' --cluster_size=1 --num_coordinators=1 > --log_dir=/data/jenkins/workspace/impala-cdw-master-core-s3-arm-data-cache/repos/Impala/logs/custom_cluster_tests > --log_level=1 '--impalad_args=--enable_workload_mgmt=true > --query_log_write_interval_s=1 --shutdown_grace_period_s=0 > --shutdown_deadline_s=60 --logbuflevel=-1 --workload_mgmt_schema_version=foo > --minidump_path=/data/jenkins/workspace/impala-cdw-master-core-s3-arm-data-cache/repos/Impala/logs/custom_cluster_tests/impala_test_minidumps_QqiGby > ' '--state_store_args=--logbuflevel=-1 ' > '--catalogd_args=--enable_workload_mgmt=true --logbuflevel=-1 > --workload_mgmt_schema_version=foo > --minidump_path=/data/jenkins/workspace/impala-cdw-master-core-s3-arm-data-cache/repos/Impala/logs/custom_cluster_tests/impala_test_minidumps_QqiGby > ' '--admissiond_args=--logbuflevel=-1 ' > --impalad_args=--default_query_options= > 19:10:02 MainThread: Found 0 impalad/0 statestored/0 catalogd process(es) > 19:10:02 MainThread: Starting State Store logging to > /data/jenkins/workspace/impala-cdw-master-core-s3-arm-data-cache/repos/Impala/logs/custom_cluster_tests/statestored.INFO > 19:10:03 MainThread: Starting Catalog Service logging to > /data/jenkins/workspace/impala-cdw-master-core-s3-arm-data-cache/repos/Impala/logs/custom_cluster_tests/catalogd.INFO > 19:10:03 MainThread: Starting Impala Daemon logging to > /data/jenkins/workspace/impala-cdw-master-core-s3-arm-data-cache/repos/Impala/logs/custom_cluster_tests/impalad.INFO > 19:10:05 MainThread: Found 1 impalad/1 statestored/1 catalogd process(es) > 19:10:05 MainThread: Waiting for Impalad webserver port 25000 > 19:10:05 MainThread: Waiting for Impalad webserver port 25000 > 19:10:06 MainThread: Waiting for Impalad webserver port 25000 > 19:10:06 MainThread: Waiting for Impalad webserver port 25000 > 19:10:07 MainThread: Waiting for Impalad webserver port 25000 > 19:10:07 MainThread: Waiting for Impalad webserver port 25000 > 19:10:08 MainThread: Waiting for Impalad webserver port 25000 > 19:10:08 MainThread: Waiting for Impalad webserver port 25000 > 19:10:09 MainThread: Waiting for Impalad webserver port 25000 > 19:10:09 MainThread: Waiting for Impalad webserver port 25000 > 19:10:09 MainThread: Error starting cluster > Traceback (most recent call last): > File > "/data/jenkins/workspace/impala-cdw-master-core-s3-arm-data-cache/repos/Impala/bin/start-impala-cluster.py", > line 1190, in <module> > impala_cluster.wait_until_ready(expected_cluster_size, > expected_num_ready_impalads) > File > "/data/jenkins/workspace/impala-cdw-master-core-s3-arm-data-cache/repos/Impala/tests/common/impala_cluster.py", > line 239, in wait_until_ready > impalad.wait_for_webserver(sleep_interval, check_processes_still_running) > File > "/data/jenkins/workspace/impala-cdw-master-core-s3-arm-data-cache/repos/Impala/tests/common/impala_cluster.py", > line 636, in wait_for_webserver > early_abort_fn() > File > "/data/jenkins/workspace/impala-cdw-master-core-s3-arm-data-cache/repos/Impala/tests/common/impala_cluster.py", > line 234, in check_processes_still_running > assert self.catalogd is not None > AssertionError > -- 2025-03-29 19:10:09,861 DEBUG MainThread: Found 1 impalad/1 > statestored/0 catalogd process(es) > -- 2025-03-29 19:10:09,863 INFO MainThread: Expected log lines could not > be found, sleeping before retrying: Expected 1 lines in file > /data0/jenkins/workspace/impala-cdw-master-core-s3-arm-data-cache/repos/Impala/logs/custom_cluster_tests/impalad.impala-ec2-rhel88-m7g-4xlarge-ondemand-11dc.vpc.cloudera.com.jenkins.log.FATAL.20250329-144759.294272 > matching regex 'Invalid workload management schema version 'foo'', but found > 0 lines. Last line was: > . Impalad exiting. > -- 2025-03-29 19:10:10,864 INFO MainThread: Expected log lines could not > be found, sleeping before retrying: Expected 1 lines in file > /data0/jenkins/workspace/impala-cdw-master-core-s3-arm-data-cache/repos/Impala/logs/custom_cluster_tests/impalad.impala-ec2-rhel88-m7g-4xlarge-ondemand-11dc.vpc.cloudera.com.jenkins.log.FATAL.20250329-144759.294272 > matching regex 'Invalid workload management schema version 'foo'', but found > 0 lines. Last line was: > . Impalad exiting. > {code} -- This message was sent by Atlassian Jira (v8.20.10#820010)