Mapjoin fails on multiple partitions ------------------------------------ Key: HIVE-1814 URL: https://issues.apache.org/jira/browse/HIVE-1814 Project: Hive Issue Type: Bug Reporter: Romain Thibaux
This query works: set hive.optimize.bucketmapjoin = true; set hive.optimize.bucketmapjoin.sortedmerge = true; set hive.input.format = org.apache.hadoop.hive.ql.io.BucketizedHiveInputFormat; SELECT /*+ MAPJOIN(b) */ a.field_a, b.field_b FROM table_a a JOIN table_b b ON a.ds = '2010-08-30' AND b.ds = '2010-08-30' AND a.user = b.user; This query fails with a Null Pointer Exception: set hive.optimize.bucketmapjoin = true; set hive.optimize.bucketmapjoin.sortedmerge = true; set hive.input.format = org.apache.hadoop.hive.ql.io.BucketizedHiveInputFormat; SELECT /*+ MAPJOIN(b) */ a.field_a, b.field_b FROM table_a a JOIN table_b b ON a.ds >= '2010-08-30' AND b.ds <= '2010-09-30' AND b.ds >= '2010-08-30' AND b.ds <= '2010-09-30' AND a.ds = b.ds AND a.user = b.user; java.lang.NullPointerException at org.apache.hadoop.hive.ql.exec.ExecDriver.execute(ExecDriver.java:622) at org.apache.hadoop.hive.ql.exec.MapRedTask.execute(MapRedTask.java:121) at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:118) at org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:55) at org.apache.hadoop.hive.ql.exec.TaskRunner.run(TaskRunner.java:47) -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.