As part of optimization process in our Hadoop application we're trying
to use Bloom filter in order not to pass needless records through to
the reduce stage.
We've noticed, that Hadoop dev team recently introduced the
implementation of BloomMapFile
intended for internal Hadoop use.
Our question now is could we use Hadoop Bloom filter implementation
for the filtering purposes of our application, or it is intended
solely for internal usage and it would be better for us to consider
Please let me know if I should provide more detail on this matter.