[ 
https://issues.apache.org/jira/browse/HUDI-1827?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17358358#comment-17358358
 ] 

Nishith Agarwal edited comment on HUDI-1827 at 6/7/21, 5:03 AM:
----------------------------------------------------------------

[~manasaks] You approach sounds good to me. For marking the baseFileFormat, you 
can pass the config like this 

.option(HoodieBootstrapConfig.HOODIE_BASE_FILE_FORMAT_PROP_NAME, "<file 
format>") 

You can add a new config here -> 
[https://github.com/apache/hudi/blob/master/hudi-spark-datasource/hudi-spark-common/src/main/scala/org/apache/hudi/DataSourceOptions.scala]

Once you've added it there, it will be passed down to the above datasource 
code. 


was (Author: nishith29):
[~manasaks] You approach sounds good to me. For marking the baseFileFormat, you 
can pass the config like this 

 

.option(HoodieBootstrapConfig.HOODIE_BASE_FILE_FORMAT_PROP_NAME, "<file 
format>")

 

You can add a new config here -> 
[https://github.com/apache/hudi/blob/master/hudi-spark-datasource/hudi-spark-common/src/main/scala/org/apache/hudi/DataSourceOptions.scala]

 

Once you've added it there, it will be passed down to the above datasource 
code. 

> Add ORC support in Bootstrap Op
> -------------------------------
>
>                 Key: HUDI-1827
>                 URL: https://issues.apache.org/jira/browse/HUDI-1827
>             Project: Apache Hudi
>          Issue Type: Sub-task
>          Components: Storage Management
>            Reporter: Teresa Kang
>            Assignee: manasa
>            Priority: Major
>
> SparkBootstrapCommitActionExecutor assumes parquet format right now, need to 
> support ORC as well.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to