huliwuli opened a new issue, #10741:
URL: https://github.com/apache/hudi/issues/10741

   Tips before filing an issue
   
   Describe the problem you faced
   
   Did Async Clustering on EMR 6.14 and Hive on Athena did not sync the latest 
commit after clustering? I want to use the hive sync tool to sync it.
   
   When using 
   ```
   cd /usr/lib/hudi/bin
   
   ./run_sync_tool.sh --base-path s3://<bucket_name>/<prefix>/<table_name> 
--database <database_name> --table <table_name> --partitioned-by <column_name>
   ```
   
   I got the error caused by java.lang.ClassNotFoundException: 
com.fasterxml.jackson,datatype.jsr310.JavaTimeModule.
   
   Also, I noticed AWS documentation includes use-jdbc false
   
![image](https://github.com/apache/hudi/assets/46934296/51ef358f-b3ac-444d-b835-30ad6cba117d)
   
   so I did 
   ```
   cd /usr/lib/hudi/bin
   
   ./run_sync_tool.sh --base-path s3://<bucket_name>/<prefix>/<table_name> 
--database <database_name> --table <table_name> --partitioned-by <column_name> 
--sync-mode hms --use-jdbc false --sync-tool-classes 
org.apache.hudi.hive.MultiPartKeysValueExtractor
   ```
   
   Then I got: 'false' but no main parameter was defined in your arg class
   
   Environment Description
   
   Hudi version : 0.13.0
   
   Spark version : 3.4.1
   
   Hive version : 0.13.1
   
   Hadoop version :
   
   Storage (HDFS/S3/GCS..) : S3
   
   Running on Docker? (yes/no) : NO


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to