[ https://issues.apache.org/jira/browse/HUDI-3765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
liujinhui updated HUDI-3765: ---------------------------- Summary: Structured streaming MOR/COW table can not asynchronous clean (was: Structured streaming MOR table can not asynchronous clean) > Structured streaming MOR/COW table can not asynchronous clean > -------------------------------------------------------------- > > Key: HUDI-3765 > URL: https://issues.apache.org/jira/browse/HUDI-3765 > Project: Apache Hudi > Issue Type: Bug > Affects Versions: 0.11.0 > Reporter: liujinhui > Priority: Major > > When writing to the mor table using structured streaming, when the > asynchronous clean service is enabled, the clean will only be triggered when > the task is restarted. Through debugging, it is found that in the process of > continuous operation, the generation of clean will not actually be triggered. > ... > > Although there will be the log, it will not trigger clean. Pretty sure that > the number of version files that triggers a clean has been reached > {code:java} > // Async cleaner has been spawned. Waiting for it to finish | > org.apache.hudi.client.BaseHoodieWriteClient.autoCleanOnCommit(BaseHoodieWriteClient.java:541) > 2022-03-31 19:26:06,677 | INFO | [stream execution thread for [id = > ce830c81-00c6-4d83-8a60-1970b8a6a1c9, runId = > d57ad29c-c63e-441d-b183-0e5eb028acc5]] | Waiting for async clean service to > finish | > org.apache.hudi.async.AsyncCleanerService.waitForCompletion(AsyncCleanerService.java:73) > 2022-03-31 19:26:06,677 | INFO | [stream execution thread for [id = > ce830c81-00c6-4d83-8a60-1970b8a6a1c9, runId = > d57ad29c-c63e-441d-b183-0e5eb028acc5]] | Async cleaner has finished | > org.apache.hudi.client.BaseHoodieWriteClient.autoCleanOnCommit(BaseHoodieWriteClient.java:543) > {code} -- This message was sent by Atlassian Jira (v8.20.1#820001)