[ https://issues.apache.org/jira/browse/FLINK-25330?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17464458#comment-17464458 ]
Ibson commented on FLINK-25330: ------------------------------- Hi [~jingge] We use HBase as a dimension table, Each piece of data in the fact table will be joined with the dimension table data. In this case, It will happen get the date after deleting it. I think set 'KEEP_DELETED_CELLS => true' can't solve this problem, It may be get the data which should been deleted. > Flink SQL doesn't retract all versions of Hbase data > ---------------------------------------------------- > > Key: FLINK-25330 > URL: https://issues.apache.org/jira/browse/FLINK-25330 > Project: Flink > Issue Type: Bug > Components: Connectors / HBase > Reporter: Bruce Wong > Assignee: Jing Ge > Priority: Critical > Labels: pull-request-available > Attachments: Flink-SQL-Test.zip, bundle_data.zip, > image-2021-12-15-20-05-18-236.png, test_res.png, test_res_1.png > > > h2. Background > When we use CDC to synchronize mysql data to HBase, we find that HBase > deletes only the last version of the specified rowkey when deleting mysql > data. The data of the old version still exists. You end up using the wrong > data. And I think its a bug of HBase connector. > The following figure shows Hbase data changes before and after mysql data is > deleted. > !image-2021-12-15-20-05-18-236.png|width=910,height=669! > > h2. -- This message was sent by Atlassian Jira (v8.20.1#820001)