[ 
https://issues.apache.org/jira/browse/FLINK-35740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17868336#comment-17868336
 ] 

Leonard Xu commented on FLINK-35740:
------------------------------------

[~ouyangwuli] We can support other connectors later, support this feature in 
MySQL firstly makes sense to me as MySQL is the top-1 widely using cdc 
connector.

> [mysql] Allow column as chunk key even if not in Primary Keys
> -------------------------------------------------------------
>
>                 Key: FLINK-35740
>                 URL: https://issues.apache.org/jira/browse/FLINK-35740
>             Project: Flink
>          Issue Type: Improvement
>          Components: Flink CDC
>            Reporter: Lee SeungMin
>            Assignee: Lee SeungMin
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: cdc-3.2.0
>
>
> PR: [[FLINK-35740] Allow column as chunk key even if not in Primary Keys by 
> SML0127 · Pull Request #3448 · apache/flink-cdc 
> (github.com)|https://github.com/apache/flink-cdc/pull/3448]
> Allow column as chunk key even if it is not in the primary keys.
>  
> There are cases where the primary key is not a numeric type, such as varchar 
> or varbinary.
> In this case, the `distributed factor` and `chunk range` may be calculated 
> incorrectly, resulting in one chunk containing too many records.
> But there was no conditions for restrict type for primary keys or chunk key 
> columns, so this may cause out of memory in the task manager.
> Actually, in our company, there was a mysql tables that PK is a varbinary 
> column and the above situation occurred.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to