Handling data skewness

Karthick Fri, 16 Aug 2024 16:51:23 -0700

Hi Team,

I'm using keyBy to maintain field-based ordering across tasks, but I'm
encountering data skewness among the task slots. I have 96 task slots, and
I'm sending data with 500 distinct keys used in keyBy. While reviewing the
Flink UI, I noticed that a few task slots are underutilized while others
are overutilized.


This seems to be a hashing problem. Can anyone suggest a better hashing
technique or approach to resolve this issue?

Thanks in advance for your help.

Handling data skewness

Reply via email to