Github user tillrohrmann commented on the pull request: https://github.com/apache/flink/pull/1069#issuecomment-135838438 @anisnasir thanks for your contribution. Out of curiosity I was wondering why the `PartialPartitioner` distributes the data exactly between two channels. Wouldn't it also be conceivable to distribute it between an arbitrary number? Then one could adjust the `PartialPartitioner` depending on the actual data skew. I assume that there are situations where your data is still skewed even after distributing it onto two different consumers. It would be great if you could add another test which tests the functioning of the partitioner in a more applied scenario, if possible. Maybe one could check that the skew of the input data decreases after the partitioning.
--- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---