Re: Performing Reduce on a group of datasets

2016-05-19 Thread Fabian Hueske
I think that sentence is misleading and refers to the internals of Flink. It should be removed, IMO. You can only union two DataSets. If you want to union more, you have to do it one by one. Btw. union does not cause additional processing overhead. Cheers, Fabian 2016-05-19 14:44 GMT+02:00 Rites

Re: Performing Reduce on a group of datasets

2016-05-19 Thread Ritesh Kumar Singh
Thanks for the reply Fabian, Though here's a small thing I found on the documentation page: https://ci.apache.org/projects/flink/flink-docs-release-0.8/programming_guide.html#transformations If you look into the Union section, "This operation happens implicitly if more than one data set is used f

Re: Performing Reduce on a group of datasets

2016-05-18 Thread Fabian Hueske
I think union is what you are looking for. Note that all data sets must be of the same type. 2016-05-18 16:15 GMT+02:00 Ritesh Kumar Singh : > Hi, > > How can I perform a reduce operation on a group of datasets using Flink? > Let's say my map function gives out n datasets: d1, d2, ... dN > Now I

Performing Reduce on a group of datasets

2016-05-18 Thread Ritesh Kumar Singh
Hi, How can I perform a reduce operation on a group of datasets using Flink? Let's say my map function gives out n datasets: d1, d2, ... dN Now I wish to perform my reduce operation on all the N datasets at once and not on an individual level. The only way I figured out till now is using the union