[ 
https://issues.apache.org/jira/browse/HUDI-2807?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sivabalan narayanan closed HUDI-2807.
-------------------------------------
    Resolution: Cannot Reproduce

> Failing to acquire lock with async clustering if clustering gets delayed due 
> to lack of resources
> -------------------------------------------------------------------------------------------------
>
>                 Key: HUDI-2807
>                 URL: https://issues.apache.org/jira/browse/HUDI-2807
>             Project: Apache Hudi
>          Issue Type: Bug
>    Affects Versions: 0.10.0
>            Reporter: sivabalan narayanan
>            Priority: Major
>         Attachments: datatimeline.out
>
>
> With deltastreamer continuous mode and multi writer enabled, if async 
> clustering while about to commit, acquires lock, and gets delayed to complete 
> the transaction due to large writes or due to lack of resources, regular 
> delta commits on the data timeline fails to acquire lock. I do understand 
> this behavior might not be surprising, given thats how locking semantics work.
> But I tried increasing the num retries for lock acquisition for ZK based lock 
> provider, but even after 8 mins ish, clustering does not get a compute share 
> and regular writes fail. when deltastreamer was shutdown, the clustering gets 
> to completion. 
>  
> From logs, I see that metadata table writes for replace commit happen 
> quickly, but the release of the lock and data table commit does not happen 
> until delta streamer is shutdown by regular writes (failed to acquire lock).
>  
>  
>  
>  
>  



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to