rishabhbandi opened a new issue, #6055:
URL: https://github.com/apache/hudi/issues/6055

   **Describe the problem you faced**
   
   **Scenario #1:**
   
   1)created a dataframe(**targetDf**) and using the below statement to write 
it in GCS Bucket location (for ex - **locA**)
   
targetDF.write.format(org.apache.hudi).options(hudiWriteConf).mode(SaveMode.Overwrite).save(locA)
   
   2)then we are creating an external hudi table on locA. lets call it 
**ext_hudi_tbl_on_locA**
   
   3)next we have our dataframe which contains record that has columns to be 
updated. lets call it **updDf**.
   
   4)we are creating a spark table on top of **updDf** in spark session, lets 
call it **upd_spark_tbl**.
   
   5)then we run the merge command using spark.sql() way on 
**ext_hudi_tbl_on_locA** using **upd_spark_tbl**, the statement finishes 
without any error but it does not update any record.
   
   NOTE: we checked that there is no data issue, if we join the tables 
**ext_hudi_tbl_on_locA** and **upd_spark_tbl** it works and give the joined 
data result.
   
   
   **Scenario #2**
   
   1)we create an managed hudi table. lets call **int_hudi_tbl**
   
   2)we insert data from **targetDf** into the above hudi table. using 
spark.sql() way.
   
   3)next we have our dataframe which contains record that has columns to be 
updated lets call it **updDf**.
   
   4)we are creating a spark table on top of **updDf** in spark session, lets 
call it **upd_spark_tbl**
   
   5)then we run the merge command using spark.sql() way on **int_hudi_tbl** 
using **upd_spark_tbl**, the statement finishes without any error but this time 
it updates the data.
   
   
   CONCLUSION
   Scenario #1: no error thrown and update does not works, Scenario #2: no 
error thrown and update works.
   
   Please advise why its not working in Scenario #1.
   
   
   **Environment Description**
   
   * Hudi version : 0.11.0
   
   * Spark version : 2.4.8
   
   * Hive version :2.3.7
   
   * Hadoop version :
   
   * Storage (HDFS/S3/GCS..) : GCS
   
   * Running on Docker? (yes/no) : no
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to