[GitHub] [hudi] 117th opened a new issue #4700: [SUPPORT] Adding new column to table is not propogated to Hive via HMS sync mode

GitBox Thu, 27 Jan 2022 06:22:01 -0800


117th opened a new issue #4700:
URL: https://github.com/apache/hudi/issues/4700



   **Steps to reproduce the behavior**:
   
   1. Enable Hive sync via options:
   DataSourceWriteOptions.HIVE_SYNC_MODE.key() -> "hms",
   DataSourceWriteOptions.HIVE_SYNC_ENABLED.key() -> "true",
   DataSourceWriteOptions.META_SYNC_ENABLED.key() -> "true"
   
       Spark conf:
       spark.hadoop.hive.metastore.uris thrift://hive-metastore:9083
   
   2. Write dataset with some schema
   3. Write the same dataframe with new column (with exactly same options)
   
   **Expected behavior**:
   
   Hive table is created and after the second write new column is added
   
   **Actual behavior**:
   
   After the first write table is created and everything works fine. But after 
the second write column is not added to table metadata in Hive and I'm able to 
query data only by previous schema. New column is added to parquet files 
correctly and can be selected via Spark
   
   **Environment Description**
   
   * Hudi version :
   0.10, also tried with 0.10.1
   * Spark version :
   3.1.2
   * Hive version :
   3.0.0
   * Hadoop version :
   None
   * Storage (HDFS/S3/GCS..) :
   S3
   * Running on Docker? (yes/no) :
   Both Spark and Hive Metastore are running on K8S 
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

[GitHub] [hudi] 117th opened a new issue #4700: [SUPPORT] Adding new column to table is not propogated to Hive via HMS sync mode

Reply via email to