[ https://issues.apache.org/jira/browse/HUDI-9043?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17936425#comment-17936425 ]
Geser Dugarov commented on HUDI-9043: ------------------------------------- {color:#000000}`RowDataStreamWriteFunction{color}::deduplicateRecordsIfNeeded` should be completed first now, and then we could check costs. > Analyze possibility to optimize `FlinkWriteHelper::deduplicateRecords` > ---------------------------------------------------------------------- > > Key: HUDI-9043 > URL: https://issues.apache.org/jira/browse/HUDI-9043 > Project: Apache Hudi > Issue Type: Task > Reporter: Geser Dugarov > Assignee: Geser Dugarov > Priority: Major > > `FlinkWriteHelper::deduplicateRecords` looks like too costly. -- This message was sent by Atlassian Jira (v8.20.10#820010)