Hey Iñaki, you can use the KeySelector as described here: https://ci.apache.org/projects/flink/flink-docs-release-1.0/apis/common/index.html#specifying-keys
But you only a local view for the current element, e.g. the library you use to determine the similarity has to know the similarities upfront. – Ufuk On Mon, Jun 6, 2016 at 9:31 AM, iñaki williams <juanramall...@gmail.com> wrote: > Hi guys, > > I am using Flink on my project and I have a question. (I am using Java) > > Is it possible to modify the keyby method in order to key by similarities > and not by the exact name? > > Example: I recieve 2 DataStreams, in the first one , the name of the field > that I want to KeyBy is "John Locke", while in the Datastream the field > value is "John L". Can I use some java library to find for similarities > between strings and if the similitude is high, then key those elements > together.