[
https://issues.apache.org/jira/browse/HUDI-2175?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
ASF GitHub Bot updated HUDI-2175:
---------------------------------
Labels: pull-request-available (was: )
> Support dynamic schemas with hudi
> ---------------------------------
>
> Key: HUDI-2175
> URL: https://issues.apache.org/jira/browse/HUDI-2175
> Project: Apache Hudi
> Issue Type: Improvement
> Components: Common Core
> Reporter: sivabalan narayanan
> Priority: Major
> Labels: pull-request-available
>
> Sometimes, users have a requirement where they have different producers and
> each producer produces only a subset of columns.
>
> for eg:
> Producer 1: rec_key, colA, colB, colC
> Producer 2: rec_key, colC, colD, colE, colF
> Producer 3: rec_key, colB, colF, colI, colK
>
> Expectation from hudi:
> keep merging new columns and inject defaults values for all other missing
> columns.
>
> So, for above usecase, final hudi table's schema is expected to be
> rec_key, colA, colB, colC, colD, colE, colF, colI, colK
>
>
>
>
>
--
This message was sent by Atlassian Jira
(v8.20.1#820001)