[GitHub] [inlong] Yizhou-Yang commented on a diff in pull request #7924: [INLONG-7900][Sort] Support partition by primary key when upsert single table of Kafka

via GitHub Thu, 04 May 2023 23:46:03 -0700


Yizhou-Yang commented on code in PR #7924:
URL: https://github.com/apache/inlong/pull/7924#discussion_r1185687301



##########
inlong-sort/sort-connectors/kafka/src/main/java/org/apache/inlong/sort/kafka/partitioner/PrimaryKeyPartitioner.java:
##########


Review Comment:
   Hi!  RawdataHashPartitioner was based on canal/debezium data and are parsed 
using corresponding JsonDynamicFormat. In contrast, the PrimaryKeyPartitioner 
is used primarily to support csv/avro/json format (but needs to keep 
compatibility with canal/debezium in case), which does not have 
JsonDynamicFormat and need not to be parsed. If we use RawDataHashPartitioner, 
there will be Null Pointer Exceptions from the missing JsonDynamicFormat.
   
   Another thing, design-wise, is that raw data is coming mostly from multiple 
sink scenerios, so instead of extending that partitioner, I want to make a 
seperate partitioner for single table, so that it might be more extendable and 
easier to maintain in the future.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@inlong.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

[GitHub] [inlong] Yizhou-Yang commented on a diff in pull request #7924: [INLONG-7900][Sort] Support partition by primary key when upsert single table of Kafka

Reply via email to