[ 
https://issues.apache.org/jira/browse/KAFKA-15034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17727119#comment-17727119
 ] 

Yash Mayya commented on KAFKA-15034:
------------------------------------

Thanks for filing this ticket [~baz33]. I think what you're suggesting makes 
sense and I've raised this PR which uses a HashSet for the include / exclude 
fields in the ReplaceField SMT and adds a JMH benchmark to demonstrate the 
performance improvements - [https://github.com/apache/kafka/pull/13776] 

> Improvement of ReplaceField performance for long list
> -----------------------------------------------------
>
>                 Key: KAFKA-15034
>                 URL: https://issues.apache.org/jira/browse/KAFKA-15034
>             Project: Kafka
>          Issue Type: Improvement
>          Components: KafkaConnect
>    Affects Versions: 3.4.0
>            Reporter: BDeus
>            Assignee: Yash Mayya
>            Priority: Minor
>
> SMTs ReplaceField use List<String> for include and exclude filter that use 
> ArrayList internally.
> In case of long list of filter the complexity of arraylist _O(n )_ results in 
> poor performance.
> Could we use HashSet implementation in ReplaceField class instead of the 
> traditionnal ArrayList ?



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to