[ https://issues.apache.org/jira/browse/HIVE-10720?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14565782#comment-14565782 ]
Aihua Xu commented on HIVE-10720: --------------------------------- Thanks for your understanding. I have a followup jira to rework on HIVE-5193. Yeah. I have verified that revert will work. Of course, upstream has an additional issue that I also have that jira as the subtask. Will try to fix it after the revert. Thanks again. > Pig using HCatLoader to access RCFile and perform join but get incorrect > result. > -------------------------------------------------------------------------------- > > Key: HIVE-10720 > URL: https://issues.apache.org/jira/browse/HIVE-10720 > Project: Hive > Issue Type: Bug > Components: HCatalog > Affects Versions: 1.3.0 > Reporter: Aihua Xu > Assignee: Aihua Xu > Attachments: HIVE-10720.patch > > > {noformat} > Create table tbl1 (c1 string, c2 string, key string, value string) stored as > rcfile; > Create table tbl2 (key string, value string); > insert into tbl1 values('c1', 'c2', '1', 'value1'); > insert into tbl2 values('1', 'value2'); > {noformat} > Pig script: > {noformat} > tbl1 = LOAD 'tbl1' USING org.apache.hive.hcatalog.pig.HCatLoader(); > tbl2 = LOAD 'tbl2' USING org.apache.hive.hcatalog.pig.HCatLoader(); > src_tbl1 = FILTER tbl1 BY (key == '1'); > prj_tbl1 = FOREACH src_tbl1 GENERATE > c1 as c1, > c2 as c2, > key as tbl1_key; > > src_tbl2 = FILTER tbl2 BY (key == '1'); > prj_tbl2 = FOREACH src_tbl2 GENERATE > key as tbl2_key; > > result = JOIN prj_tbl1 BY (tbl1_key), prj_tbl2 BY (tbl2_key); > dump result; > {noformat} > You will see result "(,,1,1)" and we are missing c1 and c2 values. -- This message was sent by Atlassian JIRA (v6.3.4#6332)