hello, I tried to import data from SQL to Hive RCFile. I use RCFile.Writer to generate rcfile and upload to HIVE table directory. the rcfiles has different columns: c0_1.rc has 2 columns c1_1.rc has 3 columns c2_1.rc has 4 columns
the outputs was strange: # All rc files are loaded: hive> select * from simple; OK 1 foo NULL null 2 bar NULL null 3 foobar NULL null 3 haliluya NULL null 4 holy shit NULL null 4 NULL null 7 NULL null # c1_1.rc and c2_1.rc are loaded. hive> select * from simple; OK 3 haliluya 2013-05-17 14:19:02 null 4 holy shit 2013-05-17 14:19:20 null 4 2013-05-17 14:19:45 null 7 2013-05-17 14:20:02 null # only c2_1 is loaded. hive> select * from simple; OK 4 2013-05-17 14:19:45 4Vx� 7 2013-05-17 14:20:02 �P"�:mo�d�)�2�� i create the table use the command ``` create table foo (k int, v text, t timestamp, v1 binary)stored as rcfile; ``` i am using hive-0.8.1. and the appendix contains the rc files. can anyone help to solve this? did it a known bug? or should i use a later version of hive? thanks.
c0_1.rc
Description: Binary data
c1_1.rc
Description: Binary data
c2_1.rc
Description: Binary data