Tim Luo created HADOOP-11327:
--------------------------------

             Summary: BloomFilter#not() omits the last bit, resulting in an 
incorrect filter
                 Key: HADOOP-11327
                 URL: https://issues.apache.org/jira/browse/HADOOP-11327
             Project: Hadoop Common
          Issue Type: Bug
          Components: util
    Affects Versions: 2.5.1
            Reporter: Tim Luo
            Assignee: Tim Luo
            Priority: Minor


There's an off-by-one error in {{BloomFilter#not()}}:

{{BloomFilter#not}} calls {{BitSet#flip(0, vectorSize - 1)}}, but according to 
the javadoc for that method, {{toIndex}} is end-_exclusive_:
{noformat}
* @param  toIndex index after the last bit to flip
{noformat}

This means that the last bit in the bit array is not flipped.
Specifically, this was discovered in the following scenario:
1. A new/empty {{BloomFilter}} was created with vectorSize=7.
2. Invoke {{bloomFilter.not()}}; now expecting a bloom filter with all 7 bits 
(0 through 6) flipped to 1 and membershipTest(...) to always return true.
3. However, membershipTest(...) was found to often not return true, and upon 
inspection, the BitSet only had bits 0 through 5 flipped.

The fix should be simple: remove the "- 1" from the call to {{BitSet#flip}}.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to