I agree with Theo. I think it’s a nice feature to have as part of the
standard API because only few users will be aware of something like
DataSetUtils. However, as a first version we can make it part of
DataSetUtils.

Cheers,
Till
​

On Wed, Jun 10, 2015 at 11:52 AM Theodore Vasiloudis <
theodoros.vasilou...@gmail.com> wrote:

> +1 for Fabian, but I would very much like to see this as part of the API in
> the future.
>
> This function would be very useful for FlinkML as well, as we noted in a
> recent discussion on the mailing list regarding time series datasets.
>
> On Wed, Jun 10, 2015 at 10:56 AM, Fabian Hueske <fhue...@gmail.com> wrote:
>
> > As Andra said, I'd would not add it to the API at this point.
> > However, I don't think it should go into a separate Maven module
> > (flink-contrib) that needs to be added as dependency but rather into some
> > DataSetUtils class in flink-java.
> >
> > We can easily add it to the API later, if necessary. We should however,
> > extend the documentation such that users are aware of the DataSetUtils.
> >
> > Cheers, Fabian
> >
> > 2015-06-10 10:45 GMT+02:00 Andra Lungu <an...@apache.org>:
> >
> > > Hey everyone,
> > >
> > > We needed to assign unique labels as vertex values in Gelly at some
> > point.
> > > We got a nice suggestion on how to do that in parallel (Implemented in
> > > https://github.com/apache/flink/pull/801#issuecomment-110654447).
> > >
> > > Now the question is where should these two functions go? Should they be
> > > part of the API? Something like:
> > >
> > > class DataSet<T> {
> > >   public DataSet<Tuple2<Long, T>> zipWithID() {}
> > > }
> > >
> > > or should they go in flink-contrib? Fabian, Robert and Till seem to be
> > > in favour of
> > > the second option.
> > >
> > > Thanks!
> > >
> > > Andra
> > >
> >
>

Reply via email to