sivabalan narayanan created HUDI-3889:
-----------------------------------------

             Summary: Do not validate table config if save mode is set to 
Overwrite
                 Key: HUDI-3889
                 URL: https://issues.apache.org/jira/browse/HUDI-3889
             Project: Apache Hudi
          Issue Type: Task
          Components: spark
            Reporter: sivabalan narayanan


with spark datasource write, if Overwrite is set as save mode, we should not do 
table config validation 

 
{code:java}
scala> df.write.format("hudi").
     |   option(PRECOMBINE_FIELD_OPT_KEY, "tpep_dropoff_datetime").
     |   option(RECORDKEY_FIELD_OPT_KEY, "tpep_pickup_datetime").
     |   option(PARTITIONPATH_FIELD_OPT_KEY, "date_col").
     |   option(TABLE_NAME, "hudi_tbl1").
     |   option("hoodie.embed.timeline.server","false").
     |   mode(Overwrite).
     |   save(basePath)
warning: one deprecation; for details, enable `:setting -deprecation' or 
`:replay -deprecation'
org.apache.hudi.exception.HoodieException: Config conflict(key  current value   
existing value):
RecordKey:      tpep_pickup_datetime    id
PreCombineKey:  tpep_dropoff_datetime   created_at
  at 
org.apache.hudi.HoodieWriterUtils$.validateTableConfig(HoodieWriterUtils.scala:161)
  at org.apache.hudi.HoodieSparkSqlWriter$.write(HoodieSparkSqlWriter.scala:87)
  at org.apache.hudi.DefaultSource.createRelation(DefaultSource.scala:161)
  at 
org.apache.spark.sql.execution.datasources.SaveIntoDataSourceCommand.run(SaveIntoDataSourceCommand.scala:45)
 {code}



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to