xiarixiaoyao opened a new pull request, #6017:
URL: https://github.com/apache/hudi/pull/6017

   …d add new fields.
   
   ## *Tips*
   - *Thank you very much for contributing to Apache Hudi.*
   - *Please review https://hudi.apache.org/contribute/how-to-contribute before 
opening a pull request.*
   
   ## What is the purpose of the pull request
   
   Improve schema reconciliation to make it more flexible in presence of full 
schema evolution being enabled.
   
   Desired behavior:
   
   1) incoming data has missing columns that were already defined in the table 
–> null values will be injected into missing columns 
   2) incoming data contains new columns not defined yet in the table -> 
columns will be added to the table schema (incoming dataframe?)
   3) incoming data has missing columns that are already defined in the table 
and new columns not yet defined in the table -> new columns will be added to 
the table schema, missing columns will be injected with null values
   4) No column should be dropped when using hive sync utility when schema 
reconciliation is enabled.
   5) support nested fields.
   ## Brief change log
   
   *(for example:)*
     - *Modify AnnotationLocation checkstyle rule in checkstyle.xml*
   
   ## Verify this pull request
   
   UT test
   
   (or)
   
   This change added tests and can be verified as follows:
   
   *(example:)*
   
     - *Added integration tests for end-to-end.*
     - *Added HoodieClientWriteTest to verify the change.*
     - *Manually verified the change by running a job locally.*
   
   ## Committer checklist
   
    - [ ] Has a corresponding JIRA in PR title & commit
    
    - [ ] Commit message is descriptive of the change
    
    - [ ] CI is green
   
    - [ ] Necessary doc changes done or have another open PR
          
    - [ ] For large changes, please consider breaking it into sub-tasks under 
an umbrella JIRA.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to