[
https://issues.apache.org/jira/browse/IMPALA-14280?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18012102#comment-18012102
]
Quanlong Huang edited comment on IMPALA-14280 at 8/5/25 1:31 PM:
-----------------------------------------------------------------
Still see failures in cleanup_database due to using the standby catalogd:
{noformat}
conftest.py:425: in cleanup
cleanup_database(client, db_name, True)
conftest.py:411: in cleanup_database
raise e
E OperationalError: Query 8a4a6e561c8ef4dd:2d99792600000000 failed:
E Request for Catalog service is rejected since catalogd
impala-ec2-rhel92-m6i-4xlarge-ondemand-0f67.vpc.cloudera.com:26001 is in
standby mode{noformat}
The test is test_warmed_up_metadata_after_failover. It drops a table, kills the
active catalogd (26001), checks a SHOW TABLES statement, then starts the killed
catalogd which becomes standby. Everything works fine except the above cleanup
error.
The problem is the test finishes too quickly to notice the HA failover. It runs
in the legacy catalog mode which doesn't need catalogd requests to serve the
SHOW TABLES statement. When the test starts the killed catalogd at the end,
coordinator doesn't receive the HA failover update yet. So in the cleanup step,
coordinator still used the old catalogd address.
To be specific, in this failed run of the test, catalogd (26001) is killed at
21:05:36,666
{code:java}
-- 2025-08-04 21:05:36,666 INFO MainThread: Killing <CatalogdProcess PID:
3672816
(/data/jenkins/workspace/impala-cdw-master-staging-core/repos/Impala/be/build/latest/service/catalogd
-logbufsecs=5 -v=1 -max_log_files=0 -log_rotation_match_pid=true
-log_filename=catalogd_node1
-log_dir=/data/jenkins/workspace/impala-cdw-master-staging-core/repos/Impala/logs/custom_cluster_tests
-kudu_master_hosts localhost --logbuflevel=-1
--catalogd_ha_reset_metadata_on_failover=false
--debug_actions=catalogd_event_processing_delay:SLEEP@2000
--enable_reload_events=true
--warmup_tables_config_file=/test-warehouse/warmup_table_list.txt
-catalog_service_port=26001 -state_store_subscriber_port=23021
-webserver_port=25021 -enable_catalogd_ha=true)> with signal 9{code}
Coordinator notices HA failover at 21:05:40.235247
{code:java}
I20250804 21:05:40.235247 3674142 exec-env.cc:789] The address of Catalog
service is changed from
impala-ec2-rhel92-m6i-4xlarge-ondemand-0f67.vpc.cloudera.com:26001 to
impala-ec2-rhel92-m6i-4xlarge-ondemand-0f67.vpc.cloudera.com:26000{code}
The DROP DATABASE statements (including the retried one) were submitted before
that.
{code:java}
I20250804 21:05:37.724413 3674124 Frontend.java:2398]
a1459fe0adc1c1d9:6f7e790900000000] Analyzing query: show tables in
test_warmed_up_metadata_after_failover_452d93b4 db: default
...
I20250804 21:05:39.783475 3674489 Frontend.java:2398]
bf4ae015658a617b:6bff50eb00000000] Analyzing query: DROP DATABASE
`test_warmed_up_metadata_after_failover_452d93b4` CASCADE db: default
...
I20250804 21:05:39.807317 3674490 client-request-state.cc:1414]
bf4ae015658a617b:6bff50eb00000000] Request for Catalog service is rejected
since catalogd
impala-ec2-rhel92-m6i-4xlarge-ondemand-0f67.vpc.cloudera.com:26001 is in
standby mode
...
I20250804 21:05:39.816709 3674489 Frontend.java:2398]
8a4a6e561c8ef4dd:2d99792600000000] Analyzing query: DROP DATABASE
`test_warmed_up_metadata_after_failover_452d93b4` CASCADE db: default
...
I20250804 21:05:39.842710 3674503 client-request-state.cc:1414]
8a4a6e561c8ef4dd:2d99792600000000] Request for Catalog service is rejected
since catalogd
impala-ec2-rhel92-m6i-4xlarge-ondemand-0f67.vpc.cloudera.com:26001 is in
standby mode{code}
One solution is adding a wait after starting the killed catalogd until
coordinator receives the HA failover update.
was (Author: stiga-huang):
Still need to add retries on other statements in cleanup_database. Saw another
failure:
{noformat}
conftest.py:425: in cleanup
cleanup_database(client, db_name, True)
conftest.py:411: in cleanup_database
raise e
E OperationalError: Query 8a4a6e561c8ef4dd:2d99792600000000 failed:
E Request for Catalog service is rejected since catalogd
impala-ec2-rhel92-m6i-4xlarge-ondemand-0f67.vpc.cloudera.com:26001 is in
standby mode{noformat}
> TestCatalogdHA.test_warmed_up_metadata_failover_catchup fails with status
> code assertion errors
> -----------------------------------------------------------------------------------------------
>
> Key: IMPALA-14280
> URL: https://issues.apache.org/jira/browse/IMPALA-14280
> Project: IMPALA
> Issue Type: Bug
> Reporter: Surya Hebbar
> Assignee: Quanlong Huang
> Priority: Major
> Fix For: Impala 5.0.0
>
>
> Error Message
> {code:java}
> assert 404 == 200 + where 404 = <Response [404]>.status_code + and 200
> = <lookup 'status_codes'>.ok + where <lookup 'status_codes'> =
> requests.codes
> {code}
> Stacktrace
> {code:java}
> custom_cluster/test_catalogd_ha.py:643: in
> test_warmed_up_metadata_failover_catchup
> db, self._refresh_table, self._verify_refresh)
> custom_cluster/test_catalogd_ha.py:737: in _test_metadata_after_failover
> (active_catalogd, standby_catalogd) = self.__get_catalogds()
> custom_cluster/test_catalogd_ha.py:110: in __get_catalogds
> assert page.status_code == requests.codes.ok
> E assert 404 == 200
> E + where 404 = <Response [404]>.status_code
> E + and 200 = <lookup 'status_codes'>.ok
> E + where <lookup 'status_codes'> = requests.codes
> {code}
> Standard Output
> {code:java}
> Redirecting stdout to
> /data/jenkins/workspace/impala-cdw-master-staging-core-admissiond/repos/Impala/logs/custom_cluster_tests/catalogd.impala-ec2-rhel92-m6i-4xlarge-ondemand-1cc7.vpc.cloudera.com.jenkins.log.INFO.20250730-032306.3703284
> Redirecting stdout to
> /data/jenkins/workspace/impala-cdw-master-staging-core-admissiond/repos/Impala/logs/custom_cluster_tests/catalogd.impala-ec2-rhel92-m6i-4xlarge-ondemand-1cc7.vpc.cloudera.com.jenkins.log.INFO.20250730-032337.3703557
> Standard Error
> -- 2025-07-30 03:22:52,492 INFO MainThread: Starting cluster with
> command:
> /data/jenkins/workspace/impala-cdw-master-staging-core-admissiond/repos/Impala/bin/start-impala-cluster.py
> '--state_store_args=--statestore_update_frequency_ms=50
> --statestore_priority_update_frequency_ms=50
> --statestore_heartbeat_frequency_ms=50' --cluster_size=3 --num_coordinators=3
> --log_dir=/data/jenkins/workspace/impala-cdw-master-staging-core-admissiond/repos/Impala/logs/custom_cluster_tests
> --log_level=1 '--impalad_args=--use_local_catalog=true '
> '--state_store_args=--use_subscriber_id_as_catalogd_priority=true '
> '--catalogd_args=--catalog_topic_mode=minimal
> --catalogd_ha_reset_metadata_on_failover=false
> --debug_actions=catalogd_event_processing_delay:SLEEP@1000
> --enable_reload_events=true
> --warmup_tables_config_file=file:///data/jenkins/workspace/impala-cdw-master-staging-core-admissiond/repos/Impala/testdata/data/warmup_test_config.txt
> ' --enable_catalogd_ha --impalad_args=--default_query_options=
> 03:22:53 MainThread: Found 0 impalad/0 statestored/0 catalogd process(es)
> 03:22:53 MainThread: Starting State Store logging to
> /data/jenkins/workspace/impala-cdw-master-staging-core-admissiond/repos/Impala/logs/custom_cluster_tests/statestored.INFO
> 03:22:53 MainThread: Starting Catalog Service logging to
> /data/jenkins/workspace/impala-cdw-master-staging-core-admissiond/repos/Impala/logs/custom_cluster_tests/catalogd.INFO
> 03:22:53 MainThread: Starting Catalog Service logging to
> /data/jenkins/workspace/impala-cdw-master-staging-core-admissiond/repos/Impala/logs/custom_cluster_tests/catalogd_node1.INFO
> 03:22:53 MainThread: Starting Impala Daemon logging to
> /data/jenkins/workspace/impala-cdw-master-staging-core-admissiond/repos/Impala/logs/custom_cluster_tests/impalad.INFO
> 03:22:53 MainThread: Starting Impala Daemon logging to
> /data/jenkins/workspace/impala-cdw-master-staging-core-admissiond/repos/Impala/logs/custom_cluster_tests/impalad_node1.INFO
> 03:22:53 MainThread: Starting Impala Daemon logging to
> /data/jenkins/workspace/impala-cdw-master-staging-core-admissiond/repos/Impala/logs/custom_cluster_tests/impalad_node2.INFO
> 03:22:55 MainThread: Found 3 impalad/1 statestored/2 catalogd process(es)
> 03:22:55 MainThread: Waiting for Impalad webserver port 25000
> 03:22:55 MainThread: Waiting for Impalad webserver port 25000
> 03:22:56 MainThread: Waiting for Impalad webserver port 25000
> 03:22:56 MainThread: Waiting for Impalad webserver port 25000
> 03:22:56 MainThread: Waiting for Impalad webserver port 25001
> 03:22:56 MainThread: Waiting for Impalad webserver port 25002
> 03:22:58 MainThread: Waiting for coordinator client services - hs2 port:
> 21050 hs2-http port: 28000 beeswax port: 21000
> 03:23:00 MainThread: Waiting for coordinator client services - hs2 port:
> 21051 hs2-http port: 28001 beeswax port: 21001
> 03:23:02 MainThread: Waiting for coordinator client services - hs2 port:
> 21052 hs2-http port: 28002 beeswax port: 21002
> 03:23:02 MainThread: Getting num_known_live_backends from
> impala-ec2-rhel92-m6i-4xlarge-ondemand-1cc7.vpc.cloudera.com:25000
> 03:23:02 MainThread: num_known_live_backends has reached value: 3
> 03:23:02 MainThread: Getting num_known_live_backends from
> impala-ec2-rhel92-m6i-4xlarge-ondemand-1cc7.vpc.cloudera.com:25001
> 03:23:02 MainThread: num_known_live_backends has reached value: 3
> 03:23:02 MainThread: Getting num_known_live_backends from
> impala-ec2-rhel92-m6i-4xlarge-ondemand-1cc7.vpc.cloudera.com:25002
> 03:23:02 MainThread: num_known_live_backends has reached value: 3
> 03:23:02 MainThread: Total wait: 7.64s
> 03:23:02 MainThread: Impala Cluster Running with 3 nodes (3 coordinators, 3
> executors).
> -- 2025-07-30 03:23:02,956 DEBUG MainThread: Found 3 impalad/1
> statestored/2 catalogd process(es)
> -- 2025-07-30 03:23:02,956 INFO MainThread: Getting metric:
> statestore.live-backends from
> impala-ec2-rhel92-m6i-4xlarge-ondemand-1cc7.vpc.cloudera.com:25010
> -- 2025-07-30 03:23:02,959 INFO MainThread: Metric
> 'statestore.live-backends' has reached desired value: 5. total_wait: 0s
> -- 2025-07-30 03:23:02,959 DEBUG MainThread: Getting
> num_known_live_backends from
> impala-ec2-rhel92-m6i-4xlarge-ondemand-1cc7.vpc.cloudera.com:25000
> -- 2025-07-30 03:23:02,960 INFO MainThread: num_known_live_backends has
> reached value: 3
> -- 2025-07-30 03:23:02,961 DEBUG MainThread: Getting
> num_known_live_backends from
> impala-ec2-rhel92-m6i-4xlarge-ondemand-1cc7.vpc.cloudera.com:25001
> -- 2025-07-30 03:23:02,962 INFO MainThread: num_known_live_backends has
> reached value: 3
> -- 2025-07-30 03:23:02,962 DEBUG MainThread: Getting
> num_known_live_backends from
> impala-ec2-rhel92-m6i-4xlarge-ondemand-1cc7.vpc.cloudera.com:25002
> -- 2025-07-30 03:23:02,964 INFO MainThread: num_known_live_backends has
> reached value: 3
> -- 2025-07-30 03:23:02,964 INFO MainThread: beeswax:
> set
> client_identifier=custom_cluster/test_catalogd_ha.py::TestCatalogdHA::()::test_warmed_up_metadata_failover_catchup;
> -- 2025-07-30 03:23:02,964 INFO MainThread: beeswax: connected to
> impala-ec2-rhel92-m6i-4xlarge-ondemand-1cc7.vpc.cloudera.com:21000 with
> beeswax
> -- 2025-07-30 03:23:02,964 INFO MainThread: hs2:
> set
> client_identifier=custom_cluster/test_catalogd_ha.py::TestCatalogdHA::()::test_warmed_up_metadata_failover_catchup;
> -- 2025-07-30 03:23:02,964 INFO MainThread: hs2: connected to
> impala-ec2-rhel92-m6i-4xlarge-ondemand-1cc7.vpc.cloudera.com:21050 with
> impyla hs2
> -- 2025-07-30 03:23:02,964 INFO MainThread: hs2-http:
> set
> client_identifier=custom_cluster/test_catalogd_ha.py::TestCatalogdHA::()::test_warmed_up_metadata_failover_catchup;
> -- 2025-07-30 03:23:02,965 INFO MainThread: hs2-http: connected to
> impala-ec2-rhel92-m6i-4xlarge-ondemand-1cc7.vpc.cloudera.com:28000 with
> impyla hs2-http
> -- 2025-07-30 03:23:02,965 INFO MainThread: hs2-feng:
> set
> client_identifier=custom_cluster/test_catalogd_ha.py::TestCatalogdHA::()::test_warmed_up_metadata_failover_catchup;
> -- 2025-07-30 03:23:02,965 INFO MainThread: hs2-feng: connected to
> impala-ec2-rhel92-m6i-4xlarge-ondemand-1cc7.vpc.cloudera.com:11050 with
> impyla hs2-feng
> -- 2025-07-30 03:23:02,967 INFO MainThread: hs2: executing against Impala
> at impala-ec2-rhel92-m6i-4xlarge-ondemand-1cc7.vpc.cloudera.com:21050.
> session: 7b48bf0f176821be:dc50e9789a659d95 main_cursor: True user: None
> create database if not exists warmup_test_db;
> -- 2025-07-30 03:23:03,414 INFO MainThread:
> 7f4a5c54454939d0:4aba16f200000000: query started
> -- 2025-07-30 03:23:03,415 INFO MainThread:
> 7f4a5c54454939d0:4aba16f200000000: getting log for operation
> -- 2025-07-30 03:23:03,416 INFO MainThread:
> 7f4a5c54454939d0:4aba16f200000000: getting runtime profile operation
> -- 2025-07-30 03:23:03,416 INFO MainThread:
> 7f4a5c54454939d0:4aba16f200000000: closing query for operation
> -- 2025-07-30 03:23:03,447 INFO MainThread: hs2: executing against Impala
> at impala-ec2-rhel92-m6i-4xlarge-ondemand-1cc7.vpc.cloudera.com:21050.
> session: 7b48bf0f176821be:dc50e9789a659d95 main_cursor: True user: None
> create table warmup_test_db.tbl like functional.alltypes stored as parquet
> location '/test-warehouse/warmup_test_db.tbl';
> -- 2025-07-30 03:23:03,929 INFO MainThread:
> 5f41a122c82d88b6:64ea256f00000000: query started
> -- 2025-07-30 03:23:03,930 INFO MainThread:
> 5f41a122c82d88b6:64ea256f00000000: getting log for operation
> -- 2025-07-30 03:23:03,930 INFO MainThread:
> 5f41a122c82d88b6:64ea256f00000000: getting runtime profile operation
> -- 2025-07-30 03:23:03,930 INFO MainThread:
> 5f41a122c82d88b6:64ea256f00000000: closing query for operation
> -- 2025-07-30 03:23:03,958 INFO MainThread: Found PID 3701982 for
> /data/jenkins/workspace/impala-cdw-master-staging-core-admissiond/repos/Impala/be/build/latest/service/catalogd
> -logbufsecs=5 -v=1 -max_log_files=0 -log_rotation_match_pid=true
> -log_filename=catalogd
> -log_dir=/data/jenkins/workspace/impala-cdw-master-staging-core-admissiond/repos/Impala/logs/custom_cluster_tests
> -kudu_master_hosts localhost --catalog_topic_mode=minimal
> --catalogd_ha_reset_metadata_on_failover=false
> --debug_actions=catalogd_event_processing_delay:SLEEP@1000
> --enable_reload_events=true
> --warmup_tables_config_file=file:///data/jenkins/workspace/impala-cdw-master-staging-core-admissiond/repos/Impala/testdata/data/warmup_test_config.txt
> -catalog_service_port=26000 -state_store_subscriber_port=23020
> -webserver_port=25020 -enable_catalogd_ha=true
> -- 2025-07-30 03:23:03,985 INFO MainThread: Killing <CatalogdProcess PID:
> 3701982
> (/data/jenkins/workspace/impala-cdw-master-staging-core-admissiond/repos/Impala/be/build/latest/service/catalogd
> -logbufsecs=5 -v=1 -max_log_files=0 -log_rotation_match_pid=true
> -log_filename=catalogd
> -log_dir=/data/jenkins/workspace/impala-cdw-master-staging-core-admissiond/repos/Impala/logs/custom_cluster_tests
> -kudu_master_hosts localhost --catalog_topic_mode=minimal
> --catalogd_ha_reset_metadata_on_failover=false
> --debug_actions=catalogd_event_processing_delay:SLEEP@1000
> --enable_reload_events=true
> --warmup_tables_config_file=file:///data/jenkins/workspace/impala-cdw-master-staging-core-admissiond/repos/Impala/testdata/data/warmup_test_config.txt
> -catalog_service_port=26000 -state_store_subscriber_port=23020
> -webserver_port=25020 -enable_catalogd_ha=true)> with signal 9
> -- 2025-07-30 03:23:04,024 INFO MainThread: Getting metric:
> catalog-server.active-status from
> impala-ec2-rhel92-m6i-4xlarge-ondemand-1cc7.vpc.cloudera.com:25021
> -- 2025-07-30 03:23:04,028 INFO MainThread: Waiting for metric value
> 'catalog-server.active-status'=True. Current value: False. total_wait: 0s
> -- 2025-07-30 03:23:04,028 INFO MainThread: Sleeping 1s before next retry.
> -- 2025-07-30 03:23:05,029 INFO MainThread: Getting metric:
> catalog-server.active-status from
> impala-ec2-rhel92-m6i-4xlarge-ondemand-1cc7.vpc.cloudera.com:25021
> -- 2025-07-30 03:23:05,031 INFO MainThread: Metric
> 'catalog-server.active-status' has reached desired value: True. total_wait:
> 1.0043129921s
> -- 2025-07-30 03:23:05,035 INFO MainThread: hs2: executing against Impala
> at impala-ec2-rhel92-m6i-4xlarge-ondemand-1cc7.vpc.cloudera.com:21050.
> session: 7b48bf0f176821be:dc50e9789a659d95 main_cursor: True user: None
> describe warmup_test_db.tbl;
> -- 2025-07-30 03:23:06,746 INFO MainThread:
> 7e48f7c0c27463d4:727cf0f700000000: query started
> -- 2025-07-30 03:23:06,747 INFO MainThread:
> 7e48f7c0c27463d4:727cf0f700000000: getting log for operation
> -- 2025-07-30 03:23:06,747 INFO MainThread:
> 7e48f7c0c27463d4:727cf0f700000000: getting runtime profile operation
> -- 2025-07-30 03:23:06,747 INFO MainThread:
> 7e48f7c0c27463d4:727cf0f700000000: closing query for operation
> -- 2025-07-30 03:23:06,748 INFO MainThread: Starting Catalogd process:
> ['-logbufsecs=5', '-v=1', '-max_log_files=0', '-log_rotation_match_pid=true',
> '-log_filename=catalogd',
> '-log_dir=/data/jenkins/workspace/impala-cdw-master-staging-core-admissiond/repos/Impala/logs/custom_cluster_tests',
> '-kudu_master_hosts', 'localhost', '--catalog_topic_mode=minimal',
> '--catalogd_ha_reset_metadata_on_failover=false',
> '--debug_actions=catalogd_event_processing_delay:SLEEP@1000',
> '--enable_reload_events=true',
> '--warmup_tables_config_file=file:///data/jenkins/workspace/impala-cdw-master-staging-core-admissiond/repos/Impala/testdata/data/warmup_test_config.txt',
> '-catalog_service_port=26000', '-state_store_subscriber_port=23020',
> '-webserver_port=25020', '-enable_catalogd_ha=true']
> -- 2025-07-30 03:23:06,750 INFO MainThread: Getting metric:
> statestore-subscriber.connected from
> impala-ec2-rhel92-m6i-4xlarge-ondemand-1cc7.vpc.cloudera.com:25020
> -- 2025-07-30 03:23:06,751 INFO MainThread: Debug webpage not yet
> available:
> HTTPConnectionPool(host='impala-ec2-rhel92-m6i-4xlarge-ondemand-1cc7.vpc.cloudera.com',
> port=25020): Max retries exceeded with url: /jsonmetrics?json (Caused by
> NewConnectionError('<urllib3.connection.HTTPConnection object at
> 0x7f09c6d9c750>: Failed to establish a new connection: [Errno 111] Connection
> refused',))
> Turning perftools heap leak checking off
> Redirecting stderr to
> /data/jenkins/workspace/impala-cdw-master-staging-core-admissiond/repos/Impala/logs/custom_cluster_tests/catalogd.impala-ec2-rhel92-m6i-4xlarge-ondemand-1cc7.vpc.cloudera.com.jenkins.log.ERROR.20250730-032306.3703284
> -- 2025-07-30 03:23:07,754 INFO MainThread: Waiting for metric value
> 'statestore-subscriber.connected'=1. Current value: None. total_wait: 0s
> -- 2025-07-30 03:23:07,754 INFO MainThread: Sleeping 1s before next retry.
> -- 2025-07-30 03:23:08,754 INFO MainThread: Getting metric:
> statestore-subscriber.connected from
> impala-ec2-rhel92-m6i-4xlarge-ondemand-1cc7.vpc.cloudera.com:25020
> -- 2025-07-30 03:23:08,767 INFO MainThread: Metric
> 'statestore-subscriber.connected' has reached desired value: True.
> total_wait: 2.00457000732s
> -- 2025-07-30 03:23:08,794 INFO MainThread: hs2: executing against Impala
> at impala-ec2-rhel92-m6i-4xlarge-ondemand-1cc7.vpc.cloudera.com:21050.
> session: 7b48bf0f176821be:dc50e9789a659d95 main_cursor: True user: None
> alter table warmup_test_db.tbl add partition(year=2025, month=1);
> -- 2025-07-30 03:23:09,221 INFO MainThread:
> 6145ce6d36d997b7:48635e2400000000: query started
> -- 2025-07-30 03:23:09,222 INFO MainThread:
> 6145ce6d36d997b7:48635e2400000000: getting log for operation
> -- 2025-07-30 03:23:09,222 INFO MainThread:
> 6145ce6d36d997b7:48635e2400000000: getting runtime profile operation
> -- 2025-07-30 03:23:09,222 INFO MainThread:
> 6145ce6d36d997b7:48635e2400000000: closing query for operation
> -- 2025-07-30 03:23:09,249 INFO MainThread: Found PID 3701992 for
> /data/jenkins/workspace/impala-cdw-master-staging-core-admissiond/repos/Impala/be/build/latest/service/catalogd
> -logbufsecs=5 -v=1 -max_log_files=0 -log_rotation_match_pid=true
> -log_filename=catalogd_node1
> -log_dir=/data/jenkins/workspace/impala-cdw-master-staging-core-admissiond/repos/Impala/logs/custom_cluster_tests
> -kudu_master_hosts localhost --catalog_topic_mode=minimal
> --catalogd_ha_reset_metadata_on_failover=false
> --debug_actions=catalogd_event_processing_delay:SLEEP@1000
> --enable_reload_events=true
> --warmup_tables_config_file=file:///data/jenkins/workspace/impala-cdw-master-staging-core-admissiond/repos/Impala/testdata/data/warmup_test_config.txt
> -catalog_service_port=26001 -state_store_subscriber_port=23021
> -webserver_port=25021 -enable_catalogd_ha=true
> -- 2025-07-30 03:23:09,274 INFO MainThread: Killing <CatalogdProcess PID:
> 3701992
> (/data/jenkins/workspace/impala-cdw-master-staging-core-admissiond/repos/Impala/be/build/latest/service/catalogd
> -logbufsecs=5 -v=1 -max_log_files=0 -log_rotation_match_pid=true
> -log_filename=catalogd_node1
> -log_dir=/data/jenkins/workspace/impala-cdw-master-staging-core-admissiond/repos/Impala/logs/custom_cluster_tests
> -kudu_master_hosts localhost --catalog_topic_mode=minimal
> --catalogd_ha_reset_metadata_on_failover=false
> --debug_actions=catalogd_event_processing_delay:SLEEP@1000
> --enable_reload_events=true
> --warmup_tables_config_file=file:///data/jenkins/workspace/impala-cdw-master-staging-core-admissiond/repos/Impala/testdata/data/warmup_test_config.txt
> -catalog_service_port=26001 -state_store_subscriber_port=23021
> -webserver_port=25021 -enable_catalogd_ha=true)> with signal 9
> -- 2025-07-30 03:23:09,313 INFO MainThread: Getting metric:
> catalog-server.active-status from
> impala-ec2-rhel92-m6i-4xlarge-ondemand-1cc7.vpc.cloudera.com:25020
> -- 2025-07-30 03:23:09,316 INFO MainThread: Waiting for metric value
> 'catalog-server.active-status'=True. Current value: False. total_wait: 0s
> -- 2025-07-30 03:23:09,316 INFO MainThread: Sleeping 1s before next retry.
> -- 2025-07-30 03:23:10,317 INFO MainThread: Getting metric:
> catalog-server.active-status from
> impala-ec2-rhel92-m6i-4xlarge-ondemand-1cc7.vpc.cloudera.com:25020
> -- 2025-07-30 03:23:10,320 INFO MainThread: Metric
> 'catalog-server.active-status' has reached desired value: True. total_wait:
> 1.00428390503s
> -- 2025-07-30 03:23:10,324 INFO MainThread: hs2: executing against Impala
> at impala-ec2-rhel92-m6i-4xlarge-ondemand-1cc7.vpc.cloudera.com:21050.
> session: 7b48bf0f176821be:dc50e9789a659d95 main_cursor: True user: None
> show partitions warmup_test_db.tbl;
> -- 2025-07-30 03:23:37,360 INFO MainThread: Retry for error Query
> d74a2e4e80679e57:436eef8a00000000 failed:
> LocalCatalogException: Could not load table names for database
> 'warmup_test_db' from HMS
> CAUSED BY: TException: org.apache.impala.common.InternalException: Couldn't
> open transport for
> impala-ec2-rhel92-m6i-4xlarge-ondemand-1cc7.vpc.cloudera.com:26001 (connect()
> failed: Connection refused)
> CAUSED BY: InternalException: Couldn't open transport for
> impala-ec2-rhel92-m6i-4xlarge-ondemand-1cc7.vpc.cloudera.com:26001 (connect()
> failed: Connection refused)
> -- 2025-07-30 03:23:37,360 INFO MainThread: hs2: executing against Impala
> at impala-ec2-rhel92-m6i-4xlarge-ondemand-1cc7.vpc.cloudera.com:21050.
> session: 7b48bf0f176821be:dc50e9789a659d95 main_cursor: True user: None
> show partitions warmup_test_db.tbl;
> -- 2025-07-30 03:23:37,457 INFO MainThread:
> 14407808e786c43f:2ee4e3f600000000: query started
> -- 2025-07-30 03:23:37,458 INFO MainThread:
> 14407808e786c43f:2ee4e3f600000000: getting log for operation
> -- 2025-07-30 03:23:37,458 INFO MainThread:
> 14407808e786c43f:2ee4e3f600000000: getting runtime profile operation
> -- 2025-07-30 03:23:37,458 INFO MainThread:
> 14407808e786c43f:2ee4e3f600000000: closing query for operation
> -- 2025-07-30 03:23:37,459 INFO MainThread: partition result:
> ['2025\t1\t-1\t0\t0B\tNOT CACHED\tNOT
> CACHED\tPARQUET\tfalse\thdfs://localhost:20500/test-warehouse/warmup_test_db.tbl/year=2025/month=1\tNONE',
> 'Total\t\t-1\t0\t0B\t0B\t\t\t\t\t']
> -- 2025-07-30 03:23:37,459 INFO MainThread: Starting Catalogd process:
> ['-logbufsecs=5', '-v=1', '-max_log_files=0', '-log_rotation_match_pid=true',
> '-log_filename=catalogd_node1',
> '-log_dir=/data/jenkins/workspace/impala-cdw-master-staging-core-admissiond/repos/Impala/logs/custom_cluster_tests',
> '-kudu_master_hosts', 'localhost', '--catalog_topic_mode=minimal',
> '--catalogd_ha_reset_metadata_on_failover=false',
> '--debug_actions=catalogd_event_processing_delay:SLEEP@1000',
> '--enable_reload_events=true',
> '--warmup_tables_config_file=file:///data/jenkins/workspace/impala-cdw-master-staging-core-admissiond/repos/Impala/testdata/data/warmup_test_config.txt',
> '-catalog_service_port=26001', '-state_store_subscriber_port=23021',
> '-webserver_port=25021', '-enable_catalogd_ha=true']
> -- 2025-07-30 03:23:37,461 INFO MainThread: Getting metric:
> statestore-subscriber.connected from
> impala-ec2-rhel92-m6i-4xlarge-ondemand-1cc7.vpc.cloudera.com:25021
> -- 2025-07-30 03:23:37,462 INFO MainThread: Debug webpage not yet
> available:
> HTTPConnectionPool(host='impala-ec2-rhel92-m6i-4xlarge-ondemand-1cc7.vpc.cloudera.com',
> port=25021): Max retries exceeded with url: /jsonmetrics?json (Caused by
> NewConnectionError('<urllib3.connection.HTTPConnection object at
> 0x7f0939986950>: Failed to establish a new connection: [Errno 111] Connection
> refused',))
> Turning perftools heap leak checking off
> Redirecting stderr to
> /data/jenkins/workspace/impala-cdw-master-staging-core-admissiond/repos/Impala/logs/custom_cluster_tests/catalogd.impala-ec2-rhel92-m6i-4xlarge-ondemand-1cc7.vpc.cloudera.com.jenkins.log.ERROR.20250730-032337.3703557
> -- 2025-07-30 03:23:38,466 INFO MainThread: Waiting for metric value
> 'statestore-subscriber.connected'=1. Current value: None. total_wait: 0s
> -- 2025-07-30 03:23:38,466 INFO MainThread: Sleeping 1s before next retry.
> -- 2025-07-30 03:23:39,467 INFO MainThread: Getting metric:
> statestore-subscriber.connected from
> impala-ec2-rhel92-m6i-4xlarge-ondemand-1cc7.vpc.cloudera.com:25021
> -- 2025-07-30 03:23:39,477 INFO MainThread: Metric
> 'statestore-subscriber.connected' has reached desired value: True.
> total_wait: 2.0062930584s
> -- 2025-07-30 03:23:39,497 INFO MainThread: hs2: executing against Impala
> at impala-ec2-rhel92-m6i-4xlarge-ondemand-1cc7.vpc.cloudera.com:21050.
> session: 7b48bf0f176821be:dc50e9789a659d95 main_cursor: True user: None
> drop database if exists warmup_test_db cascade;
> -- 2025-07-30 03:23:39,708 INFO MainThread:
> 74436e446b5486d1:e2e5a77700000000: query started
> -- 2025-07-30 03:23:39,709 INFO MainThread:
> 74436e446b5486d1:e2e5a77700000000: getting log for operation
> -- 2025-07-30 03:23:39,709 INFO MainThread:
> 74436e446b5486d1:e2e5a77700000000: getting runtime profile operation
> -- 2025-07-30 03:23:39,709 INFO MainThread:
> 74436e446b5486d1:e2e5a77700000000: closing query for operation
> {code}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]