Github user tillrohrmann commented on the pull request:

    https://github.com/apache/flink/pull/1069#issuecomment-135838438
  
    @anisnasir thanks for your contribution. Out of curiosity I was wondering 
why the `PartialPartitioner` distributes the data exactly between two channels. 
Wouldn't it also be conceivable to distribute it between an arbitrary number? 
Then one could adjust the `PartialPartitioner` depending on the actual data 
skew. I assume that there are situations where your data is still skewed even 
after distributing it onto two different consumers.
    
    It would be great if you could add another test which tests the functioning 
of the partitioner in a more applied scenario, if possible. Maybe one could 
check that the skew of the input data decreases after the partitioning.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

Reply via email to