Theodore Vasiloudis created FLINK-2202:
------------------------------------------

             Summary: Calling distinct() requires tuple input
                 Key: FLINK-2202
                 URL: https://issues.apache.org/jira/browse/FLINK-2202
             Project: Flink
          Issue Type: Improvement
          Components: Core, Scala API
            Reporter: Theodore Vasiloudis
            Priority: Minor


Currently to call distinct on a DataSet the elements must be placed in a tuple.

This creates the need to write code like the following:

{code}
val doubleDS: DataSet[Double] = ...
val uniqueDS = doubleDS.map( el => Tuple1(el)).distinct().map(t => t._1)
{code}

which looks quite unnecessary. Ideally we would like to just have to write:
 
{code}
val uniqueDS = doubleDS.distinct()
{code}

which should be possible as long as there exists an implicit {{Ordering\[T\]}} 
for a {{DataSet\[T\]}}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to