Choice of Bloom Filter implementation in Hadoop application

Ivan Balashov Fri, 05 Jun 2009 05:31:56 -0700

Dear all,

As part of optimization process in our Hadoop application we're trying
to use Bloom filter in order not to pass needless records through to
the reduce stage.


We've noticed, that Hadoop dev team recently introduced the
implementation of BloomMapFile
(https://issues.apache.org/jira/browse/HADOOP-3063), primarily
intended for internal Hadoop use.

Our question now is could we use Hadoop Bloom filter implementation
for the filtering purposes of our application, or it is intended
solely for internal usage and it would be better for us to consider
other implementation?

Please let me know if I should provide more detail on this matter.

Thanks,


--
Kind regards,
Ivan

Choice of Bloom Filter implementation in Hadoop application

Reply via email to