Hi Team,

I'm using keyBy to maintain field-based ordering across tasks, but I'm
encountering data skewness among the task slots. I have 96 task slots, and
I'm sending data with 500 distinct keys used in keyBy. While reviewing the
Flink UI, I noticed that a few task slots are underutilized while others
are overutilized.

This seems to be a hashing problem. Can anyone suggest a better hashing
technique or approach to resolve this issue?

Thanks in advance for your help.

Reply via email to