Re: Data Source Generator emits 4 instances of the same tuple

2016-06-09 Thread Biplob Biswas
Yes Thanks a lot, also the fact that I was using ParallelSourceFunction was problematic. So as suggested by Fabian and Robert, I used Source Function and then in the flink job, i set the output of map with a parallelism of 4 to get the desired result. Thanks again. -- View this message in conte

Re: Data Source Generator emits 4 instances of the same tuple

2016-06-09 Thread Fabian Hueske
We solved this problem yesterday at the Flink Hackathon. The issue was that the source function was started with parallelism 4 and each function read the whole file. Cheers, Fabian 2016-06-06 16:53 GMT+02:00 Biplob Biswas : > Hi, > > I tried streaming the source data 2 ways > > 1. Is a simple st

Re: Data Source Generator emits 4 instances of the same tuple

2016-06-06 Thread Biplob Biswas
Hi, I tried streaming the source data 2 ways 1. Is a simple straight forward way of sending data without using the serving speed concept http://pastebin.com/cTv0Pk5U 2. The one where I use the TaxiRide source which is exactly similar except loading the data in the proper data structures. http:

Re: Data Source Generator emits 4 instances of the same tuple

2016-06-06 Thread Ufuk Celebi
Can you please post your code as well? The duplication might happen in your part of the code and not the TaxiRideSource. – Ufuk On Mon, Jun 6, 2016 at 10:30 AM, Biplob Biswas wrote: > Hi, > > I am using a Data Source Generator, which very closely resembles the one > here on the dataartisans git