Hello, I'm trying to configure ACID hive in a kerberos environment with : Hadoop 3.1.4 deployed in HA considered working and now I'm trying to setup hive with remote metastore and ACID configuration. I may misunderstand what is written in documentation https://cwiki.apache.org/confluence/display/Hive/Hive+Transactions , but to me it's cleary stated that DbTxnManager do not need zookeeper to run properly, However it seems it in fact needs a zookeeper running somewhere as I have several attempt to connect to a local zookeeper instance: 2021-09 -02T 15:22:21,708 INFO [main-SendThread(localhost: 2181 )] client.ZooKeeperSaslClient : Client will use GSSAPI as SASL mechanism. 2021-09-02T15:22:21,708 DEBUG [main-SendThread(localhost: 2181 )] client.ZooKeeperSaslClient : creating sasl client: client=h******* ;service=zookeeper;serviceHostname=localhost 2021-09-02T15:22:21,709 INFO [main-SendThread(localhost: 2181 )] zookeeper.ClientCnxn : Opening socket connection to server localhost/ 127 . 0 . 0 . 1 : 2181 . Will attempt to SASL-authenticate using Login Context section 'HiveZooKeeperClient' 2021-09-02T15:22:21,710 WARN [main-SendThread(localhost: 2181 )] zookeeper.ClientCnxn : Session 0x0 for server null , unexpected error, closing socket connection and attempting reconnect java.net.ConnectException : Connection refused
Could you please help me to understand properly the documentation ? Do I need or not a zookeeper instance if using DbTxnManager and if so, why it isn't stated precisely and explicitely in the documentation ? I hope you have the best of day. Antoine DUBOIS
smime.p7s
Description: S/MIME Cryptographic Signature