Allan Yan created HIVE-11033: -------------------------------- Summary: BloomFilter index is not honored by ORC reader Key: HIVE-11033 URL: https://issues.apache.org/jira/browse/HIVE-11033 Project: Hive Issue Type: Bug Affects Versions: 1.2.0 Reporter: Allan Yan
There is a bug in the org.apache.hadoop.hive.ql.io.orc.ReaderImpl class which caused the bloom filter index saved in the ORC file not being used. The reason is because the bloomFilterIndices variable defined in the SargApplier class superseded from its parent class. Here is one way to fix it {noformat} 18:46 $ diff src/java/org/apache/hadoop/hive/ql/io/orc/RecordReaderImpl.java src/java/org/apache/hadoop/hive/ql/io/orc/RecordReaderImpl.java.original 174d173 < bloomFilterIndices = new OrcProto.BloomFilterIndex[types.size()]; 178c177 < sarg, options.getColumnNames(), strideRate, types, included.length, bloomFilterIndices); --- > sarg, options.getColumnNames(), strideRate, types, included.length); 204a204 > bloomFilterIndices = new OrcProto.BloomFilterIndex[types.size()]; 673c673 < List<OrcProto.Type> types, int includedCount, OrcProto.BloomFilterIndex[] bloomFilterIndices) { --- > List<OrcProto.Type> types, int includedCount) { 677c677 < this.bloomFilterIndices = bloomFilterIndices; --- > bloomFilterIndices = new OrcProto.BloomFilterIndex[types.size()]; {noformat} -- This message was sent by Atlassian JIRA (v6.3.4#6332)