[ https://issues.apache.org/jira/browse/HIVE-10720?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14567905#comment-14567905 ]
Aihua Xu commented on HIVE-10720: --------------------------------- Thanks [~viraj] I updated the pig script above which you can use to check against your local repository. I ran against Pig 0.12, but seems like the version shouldn't matter. > Pig using HCatLoader to access RCFile and perform join but get incorrect > result. > -------------------------------------------------------------------------------- > > Key: HIVE-10720 > URL: https://issues.apache.org/jira/browse/HIVE-10720 > Project: Hive > Issue Type: Bug > Components: HCatalog > Affects Versions: 1.3.0 > Reporter: Aihua Xu > Assignee: Aihua Xu > Attachments: HIVE-10720.patch > > > {noformat} > Create table tbl1 (c1 string, c2 string, key string, value string) stored as > rcfile; > Create table tbl2 (key string, value string); > insert into tbl1 values('c1', 'c2', '1', 'value1'); > insert into tbl2 values('1', 'value2'); > {noformat} > Pig script: > {noformat} > tbl1 = LOAD 'tbl1' USING org.apache.hive.hcatalog.pig.HCatLoader(); > tbl2 = LOAD 'tbl2' USING org.apache.hive.hcatalog.pig.HCatLoader(); > src_tbl1 = FILTER tbl1 BY (key == '1'); > prj_tbl1 = FOREACH src_tbl1 GENERATE > c1 as c1, > c2 as c2, > key as tbl1_key; > > src_tbl2 = FILTER tbl2 BY (key == '1'); > prj_tbl2 = FOREACH src_tbl2 GENERATE > key as tbl2_key; > > result = JOIN prj_tbl1 BY (tbl1_key), prj_tbl2 BY (tbl2_key); > dump result; > {noformat} > You will see result "(,,1,1)" and we are missing c1 and c2 values. -- This message was sent by Atlassian JIRA (v6.3.4#6332)