[ https://issues.apache.org/jira/browse/FLINK-4964?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15667671#comment-15667671 ]
ASF GitHub Bot commented on FLINK-4964: --------------------------------------- Github user thvasilo commented on the issue: https://github.com/apache/flink/pull/2740 @greghogan Excuse my ignorance, I'm only now learning about Flink internals :) It seems like the issue here was that `partitionByRange` partitions keys in ascending order but we want the end result in descending order. @tfournier314 I think the following should work, here I use a key extractor to negate the value of the key to achieve the desired effect: ```Scala itData.map(s => (s,1)) .groupBy(0) .sum(1) .partitionByRange(x => -x._2) // Take the negative count as the key .sortPartition(1, Order.DESCENDING) .zipWithIndex ``` > FlinkML - Add StringIndexer > --------------------------- > > Key: FLINK-4964 > URL: https://issues.apache.org/jira/browse/FLINK-4964 > Project: Flink > Issue Type: New Feature > Reporter: Thomas FOURNIER > Priority: Minor > > Add StringIndexer as described here: > http://spark.apache.org/docs/latest/ml-features.html#stringindexer > This will be added in package preprocessing of FlinkML -- This message was sent by Atlassian JIRA (v6.3.4#6332)