at 6:30 AM, Ashwin Jayaprakash <
ashwin.jayaprak...@gmail.com> wrote:
> Hi, I'm trying to evaluate Flink to see if it can do efficient semi-joins
> or self-joins with filter.
>
> Problem description:
> I have 1 stream that can contain "near duplicates" records
Hi, I'm trying to evaluate Flink to see if it can do efficient semi-joins
or self-joins with filter.
Problem description:
I have 1 stream that can contain "near duplicates" records. The records
share a "family name" and so, many records can have the same family name.
But each record has a unique i