+1 for Fabian, but I would very much like to see this as part of the API in
the future.

This function would be very useful for FlinkML as well, as we noted in a
recent discussion on the mailing list regarding time series datasets.

On Wed, Jun 10, 2015 at 10:56 AM, Fabian Hueske <fhue...@gmail.com> wrote:

> As Andra said, I'd would not add it to the API at this point.
> However, I don't think it should go into a separate Maven module
> (flink-contrib) that needs to be added as dependency but rather into some
> DataSetUtils class in flink-java.
>
> We can easily add it to the API later, if necessary. We should however,
> extend the documentation such that users are aware of the DataSetUtils.
>
> Cheers, Fabian
>
> 2015-06-10 10:45 GMT+02:00 Andra Lungu <an...@apache.org>:
>
> > Hey everyone,
> >
> > We needed to assign unique labels as vertex values in Gelly at some
> point.
> > We got a nice suggestion on how to do that in parallel (Implemented in
> > https://github.com/apache/flink/pull/801#issuecomment-110654447).
> >
> > Now the question is where should these two functions go? Should they be
> > part of the API? Something like:
> >
> > class DataSet<T> {
> >   public DataSet<Tuple2<Long, T>> zipWithID() {}
> > }
> >
> > or should they go in flink-contrib? Fabian, Robert and Till seem to be
> > in favour of
> > the second option.
> >
> > Thanks!
> >
> > Andra
> >
>

Reply via email to