[ 
https://issues.apache.org/jira/browse/HUDI-3812?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Raymond Xu updated HUDI-3812:
-----------------------------
    Status: In Progress  (was: Open)

> Metadata is not enabled by default on the Read Path
> ---------------------------------------------------
>
>                 Key: HUDI-3812
>                 URL: https://issues.apache.org/jira/browse/HUDI-3812
>             Project: Apache Hudi
>          Issue Type: Bug
>            Reporter: Alexey Kudinkin
>            Assignee: Alexey Kudinkin
>            Priority: Blocker
>              Labels: pull-request-available
>             Fix For: 0.11.0
>
>
> While Metadata Table is enabled by default on the Write Path (in 
> HoodieMetadataConfig), it's disabled by default on the Read Path (at least in 
> Spark).
>  
> Now with the Data Skipping enabled by default (as of 0.10, actually) it fails 
> b/c Data Skipping now solely relies on MT and Column Stats to function.
>  
> We need to revisit current default configs to make sure they make sense. So 
> that we either
>  # Switch off Data Skipping by default as well (If we want to go 
> ultra-conservative)
>  # Switch on Metadata Table by default.
>  
> Frankly, i can hardly imagine why we'd enable MT on the write path by 
> default, but not enable it on the Read Path by default as this will bring the 
> cost of it into everyone's flows, but no benefits (out of the box, people 
> will have to discover that it's switched off and switch it on themselves, 
> which seems like something everyone is likely to do regardless).



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to