Quanlong Huang has posted comments on this change. ( http://gerrit.cloudera.org:8080/22816 )
Change subject: IMPALA-13829: Postpone catalog deleteLog GC for waitForHmsEvent requests ...................................................................... Patch Set 3: (1 comment) http://gerrit.cloudera.org:8080/#/c/22816/3/tests/metadata/test_event_processing.py File tests/metadata/test_event_processing.py: http://gerrit.cloudera.org:8080/#/c/22816/3/tests/metadata/test_event_processing.py@643 PS3, Line 643: assert False, "Failed to drop table {}.{}: {}".format(db, tbl_name, str(e)) Added this assertion since I see these drop_table HMS RPC could failed. Exception in thread Thread-4: Traceback (most recent call last): File "/home/quanlong/workspace/Impala/toolchain/toolchain-packages-gcc10.4.0/python-2.7.16/lib/python2.7/threading.py", line 801, in __bootstrap_inner self.run() File "/home/quanlong/workspace/Impala/toolchain/toolchain-packages-gcc10.4.0/python-2.7.16/lib/python2.7/threading.py", line 754, in run self.__target(*self.__args, **self.__kwargs) File "/home/quanlong/workspace/Impala/tests/metadata/test_event_processing.py", line 636, in drop_table_in_hive self.hive_client.drop_table(db, tbl_name, deleteData=True) File "/home/quanlong/workspace/Impala/shell/gen-py/impala_thrift_gen/hive_metastore/ThriftHiveMetastore.py", line 3913, in drop_table self.recv_drop_table() File "/home/quanlong/workspace/Impala/shell/gen-py/impala_thrift_gen/hive_metastore/ThriftHiveMetastore.py", line 3937, in recv_drop_table raise result.o1 NoSuchObjectException: NoSuchObjectException(message='null: null') Exception in thread Thread-3: Traceback (most recent call last): File "/home/quanlong/workspace/Impala/toolchain/toolchain-packages-gcc10.4.0/python-2.7.16/lib/python2.7/threading.py", line 801, in __bootstrap_inner self.run() File "/home/quanlong/workspace/Impala/toolchain/toolchain-packages-gcc10.4.0/python-2.7.16/lib/python2.7/threading.py", line 754, in run self.__target(*self.__args, **self.__kwargs) File "/home/quanlong/workspace/Impala/tests/metadata/test_event_processing.py", line 636, in drop_table_in_hive self.hive_client.drop_table(db, tbl_name, deleteData=True) File "/home/quanlong/workspace/Impala/shell/gen-py/impala_thrift_gen/hive_metastore/ThriftHiveMetastore.py", line 3913, in drop_table self.recv_drop_table() File "/home/quanlong/workspace/Impala/shell/gen-py/impala_thrift_gen/hive_metastore/ThriftHiveMetastore.py", line 3927, in recv_drop_table (fname, mtype, rseqid) = iprot.readMessageBegin() File "/home/quanlong/workspace/Impala/infra/python/env-gcc10.4.0/lib/python2.7/site-packages/thrift/protocol/TBinaryProtocol.py", line 134, in readMessageBegin sz = self.readI32() File "/home/quanlong/workspace/Impala/infra/python/env-gcc10.4.0/lib/python2.7/site-packages/thrift/protocol/TBinaryProtocol.py", line 217, in readI32 buff = self.trans.readAll(4) File "/home/quanlong/workspace/Impala/infra/python/env-gcc10.4.0/lib/python2.7/site-packages/thrift/transport/TTransport.py", line 62, in readAll chunk = self.read(sz - have) File "/home/quanlong/workspace/Impala/infra/python/env-gcc10.4.0/lib/python2.7/site-packages/thrift/transport/TTransport.py", line 164, in read self.__rbuf = BufferIO(self.__trans.read(max(sz, self.__rbuf_size))) File "/home/quanlong/workspace/Impala/infra/python/env-gcc10.4.0/lib/python2.7/site-packages/thrift/transport/TSocket.py", line 164, in read raise TTransportException(message="unexpected exception", inner=e) TTransportException: unexpected exception However, this assertion won't fail the test. Just print less info: Exception in thread Thread-4: Traceback (most recent call last): File "/home/quanlong/workspace/Impala/toolchain/toolchain-packages-gcc10.4.0/python-2.7.16/lib/python2.7/threading.py", line 801, in __bootstrap_inner self.run() File "/home/quanlong/workspace/Impala/toolchain/toolchain-packages-gcc10.4.0/python-2.7.16/lib/python2.7/threading.py", line 754, in run self.__target(*self.__args, **self.__kwargs) File "/home/quanlong/workspace/Impala/tests/metadata/test_event_processing.py", line 643, in drop_table_in_hive assert False, "Failed to drop table {}.{}: {}".format(db, tbl_name, str(e)) AssertionError: Failed to drop table test_hms_event_sync_on_deletion_115fe4d9_db.foo_2: Unrecognized type 1 assert False Exception in thread Thread-3: Traceback (most recent call last): File "/home/quanlong/workspace/Impala/toolchain/toolchain-packages-gcc10.4.0/python-2.7.16/lib/python2.7/threading.py", line 801, in __bootstrap_inner self.run() File "/home/quanlong/workspace/Impala/toolchain/toolchain-packages-gcc10.4.0/python-2.7.16/lib/python2.7/threading.py", line 754, in run self.__target(*self.__args, **self.__kwargs) File "/home/quanlong/workspace/Impala/tests/metadata/test_event_processing.py", line 643, in drop_table_in_hive assert False, "Failed to drop table {}.{}: {}".format(db, tbl_name, str(e)) AssertionError: Failed to drop table test_hms_event_sync_on_deletion_115fe4d9_db.foo: unexpected exception assert False Will remove these. The error might due to self.hive_client is not thread-safe. It could pass null.null to HMS. Saw this in HMS logs: 2025-04-25T13:49:50,021 INFO [TThreadPoolServer WorkerProcess-188] metastore.HiveMetaStore: 203: source:127.0.0.1 drop_table : tbl=null.null.null 2025-04-25T13:49:50,021 INFO [TThreadPoolServer WorkerProcess-188] HiveMetaStore.audit: ugi=quanlong ip=127.0.0.1 cmd=source:127.0.0.1 drop_table : tbl=null.null.null 2025-04-25T13:49:50,022 WARN [TThreadPoolServer WorkerProcess-188] metastore.ObjectStore: Falling back to ORM path due to direct SQL failure (this is not an error): null at org.apache.hadoop.hive.metastore.MetaStoreDirectSql.getDatabase(MetaStoreDirectSql.java:393) at org.apache.hadoop.hive.metastore.ObjectStore$1.getSqlResult(ObjectStore.java:896) 2025-04-25T13:49:50,022 ERROR [TThreadPoolServer WorkerProcess-188] metastore.ObjectStore: java.lang.NullPointerException: null at org.apache.hadoop.hive.metastore.utils.StringUtils.normalizeIdentifier(StringUtils.java:94) ~[hive-standalone-metastore-3.1.3000.7.3.1.0-160.jar:3.1.3000.7.3.1.0-160] at org.apache.hadoop.hive.metastore.ObjectStore.getMDatabase(ObjectStore.java:853) ~[hive-standalone-metastore-3.1.3000.7.3.1.0-160.jar:3.1.3000.7.3.1.0-160] at org.apache.hadoop.hive.metastore.ObjectStore.getJDODatabase(ObjectStore.java:911) ~[hive-standalone-metastore-3.1.3000.7.3.1.0-160.jar:3.1.3000.7.3.1.0-160] at org.apache.hadoop.hive.metastore.ObjectStore$1.getJdoResult(ObjectStore.java:901) ~[hive-standalone-metastore-3.1.3000.7.3.1.0-160.jar:3.1.3000.7.3.1.0-160] at org.apache.hadoop.hive.metastore.ObjectStore$1.getJdoResult(ObjectStore.java:893) ~[hive-standalone-metastore-3.1.3000.7.3.1.0-160.jar:3.1.3000.7.3.1.0-160] at org.apache.hadoop.hive.metastore.ObjectStore$GetHelper.run(ObjectStore.java:4302) ~[hive-standalone-metastore-3.1.3000.7.3.1.0-160.jar:3.1.3000.7.3.1.0-160] at org.apache.hadoop.hive.metastore.ObjectStore.getDatabaseInternal(ObjectStore.java:903) ~[hive-standalone-metastore-3.1.3000.7.3.1.0-160.jar:3.1.3000.7.3.1.0-160] at org.apache.hadoop.hive.metastore.ObjectStore.getDatabase(ObjectStore.java:875) ~[hive-standalone-metastore-3.1.3000.7.3.1.0-160.jar:3.1.3000.7.3.1.0-160] at sun.reflect.GeneratedMethodAccessor3.invoke(Unknown Source) ~[?:?] at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) ~[?:1.8.0_432] at java.lang.reflect.Method.invoke(Method.java:498) ~[?:1.8.0_432] at org.apache.hadoop.hive.metastore.RawStoreProxy.invoke(RawStoreProxy.java:97) ~[hive-standalone-metastore-3.1.3000.7.3.1.0-160.jar:3.1.3000.7.3.1.0-160] at com.sun.proxy.$Proxy33.getDatabase(Unknown Source) ~[?:?] at org.apache.hadoop.hive.metastore.HiveMetaStore$HMSHandler.drop_table_core(HiveMetaStore.java:3253) ~[hive-standalone-metastore-3.1.3000.7.3.1.0-160.jar:3.1.3000.7.3.1.0-160] Another kinds of HMS error log is 2025-04-25T13:49:50,054 ERROR [TThreadPoolServer WorkerProcess-188] server.TThreadPoolServer: Thrift Error occurred during processing of message. org.apache.thrift.protocol.TProtocolException: Missing version in readMessageBegin, old client? at org.apache.thrift.protocol.TBinaryProtocol.readMessageBegin(TBinaryProtocol.java:254) ~[libthrift-0.16.0.jar:0.16.0] at org.apache.hadoop.hive.metastore.TUGIBasedProcessor.process(TUGIBasedProcessor.java:76) ~[hive-standalone-metastore-3.1.3000.7.3.1.0-160.jar:3.1.3000.7.3.1.0-160] at org.apache.thrift.server.TThreadPoolServer$WorkerProcess.run(TThreadPoolServer.java:250) ~[libthrift-0.16.0.jar:0.16.0] at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) ~[?:1.8.0_432] at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) ~[?:1.8.0_432] at java.lang.Thread.run(Thread.java:750) ~[?:1.8.0_432] -- To view, visit http://gerrit.cloudera.org:8080/22816 To unsubscribe, visit http://gerrit.cloudera.org:8080/settings Gerrit-Project: Impala-ASF Gerrit-Branch: master Gerrit-MessageType: comment Gerrit-Change-Id: I2441440bca2b928205dd514047ba742a5e8bf05e Gerrit-Change-Number: 22816 Gerrit-PatchSet: 3 Gerrit-Owner: Quanlong Huang <huangquanl...@gmail.com> Gerrit-Reviewer: Impala Public Jenkins <impala-public-jenk...@cloudera.com> Gerrit-Reviewer: Michael Smith <michael.sm...@cloudera.com> Gerrit-Reviewer: Quanlong Huang <huangquanl...@gmail.com> Gerrit-Reviewer: Riza Suminto <riza.sumi...@cloudera.com> Gerrit-Reviewer: Zoltan Borok-Nagy <borokna...@cloudera.com> Gerrit-Comment-Date: Fri, 25 Apr 2025 09:38:04 +0000 Gerrit-HasComments: Yes