[ https://issues.apache.org/jira/browse/KAFKA-15034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17727119#comment-17727119 ]
Yash Mayya commented on KAFKA-15034: ------------------------------------ Thanks for filing this ticket [~baz33]. I think what you're suggesting makes sense and I've raised this PR which uses a HashSet for the include / exclude fields in the ReplaceField SMT and adds a JMH benchmark to demonstrate the performance improvements - [https://github.com/apache/kafka/pull/13776] > Improvement of ReplaceField performance for long list > ----------------------------------------------------- > > Key: KAFKA-15034 > URL: https://issues.apache.org/jira/browse/KAFKA-15034 > Project: Kafka > Issue Type: Improvement > Components: KafkaConnect > Affects Versions: 3.4.0 > Reporter: BDeus > Assignee: Yash Mayya > Priority: Minor > > SMTs ReplaceField use List<String> for include and exclude filter that use > ArrayList internally. > In case of long list of filter the complexity of arraylist _O(n )_ results in > poor performance. > Could we use HashSet implementation in ReplaceField class instead of the > traditionnal ArrayList ? -- This message was sent by Atlassian Jira (v8.20.10#820010)