Re: Self-join with filter

2015-08-18 Thread Ashwin Jayaprakash
at 6:30 AM, Ashwin Jayaprakash < ashwin.jayaprak...@gmail.com> wrote: > Hi, I'm trying to evaluate Flink to see if it can do efficient semi-joins > or self-joins with filter. > > Problem description: > I have 1 stream that can contain "near duplicates" records

Self-join with filter

2015-08-17 Thread Ashwin Jayaprakash
Hi, I'm trying to evaluate Flink to see if it can do efficient semi-joins or self-joins with filter. Problem description: I have 1 stream that can contain "near duplicates" records. The records share a "family name" and so, many records can have the same family name. But each record has a unique i