Re: Bloom filter in Flink

2016-12-13 Thread Fabian Hueske
Hi Gennady, this bloom filter is actually not distributed and only used internally as an optimization to reduce the amount of data spilled by a hash join. So, it is not meant to be user facing and not integrated in any API. You could of course use the code, but there might be better implementation

Bloom filter in Flink

2016-12-13 Thread Gennady Gilin
Hi Everyone, Noticed that Flink sources are contain distributed Bloom filter implementation, so wandering is somebody tried to use it in production for large scale it