Teddy Choi created HIVE-5277:
--------------------------------

             Summary: HBase handler skips rows with null valued first cells 
when only row key is selected
                 Key: HIVE-5277
                 URL: https://issues.apache.org/jira/browse/HIVE-5277
             Project: Hive
          Issue Type: Bug
          Components: HBase Handler
            Reporter: Teddy Choi
            Assignee: Teddy Choi


HBaseStorageHandler skips rows with null valued first cells when only row key 
is selected.

{noformat}
SELECT key, col1, col2 FROM hbase_table;
key1    cell1   cell2 
key2    NULL    cell3

SELECT COUNT(key) FROM hbase_table;
1
{noformat}

HiveHBaseTableInputFormat.getRecordReader makes first cell selected to avoid 
skipping rows. But when the first cell is null, HBase skips that row.

http://hbase.apache.org/book/perf.reading.html 12.9.6. Optimal Loading of Row 
Keys describes how to deal with this problem.

I tried to find an existing issue, but I couldn't. If you find a same issue, 
please make this issue duplicated.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to