Hello Community,

We are facing an issue while doing count (aggregate) queries in Hive for
Delta format tables (https://github.com/delta-io/connectors) using both
TEZ/MR, where it complains
Caused by: java.lang.RuntimeException: java.lang.RuntimeException:
java.lang.ClassNotFoundException: Class io.delta.hive.HiveInputFormat not
found

Simple select queries are working fine. The problem seems to be related to
the Delta Hive Jar dependency only being available in the local Hive CLI
classpath and not in the distributed cluster mode which causes
distributed count TEZ job to fail. The steps to add the hive.aux.jars.path to
Hive is described here https://github.com/delta-io/connectors/issues/84

Need some help with what I am doing wrong and the correct way to add 3rd
party Jars so that they are available in the classpath across the entire
cluster.

Thanks,
Abhishek

Reply via email to