I think union is what you are looking for.
Note that all data sets must be of the same type.

2016-05-18 16:15 GMT+02:00 Ritesh Kumar Singh <riteshoneinamill...@gmail.com
>:

> Hi,
>
> How can I perform a reduce operation on a group of datasets using Flink?
> Let's say my map function gives out n datasets: d1, d2, ... dN
> Now I wish to perform my reduce operation on all the N datasets at once
> and not on an individual level. The only way I figured out till now is
> using the union operator first like following:
>
> List<Dataset<X>> dataList = Arrays.asList(d1, d2, ... dN);
> Dataset<X> dFinal = null;
> for(Dataset<X> ds: dataList)
> {
>     dFinal = dFinal.union(ds);
> }
> dFinal.groupBy(0).reduce(...);
>
> Is there a more efficient way of doing the above task using java APIs?
> GroupReduce only works on a single dataset at a time and I can't find any
> other methods that take multiple datasets as an input parameter.
>
> Thanks,
> --
> Ritesh Kumar Singh
> https://riteshtoday.wordpress.com/
>

Reply via email to