[ 
https://issues.apache.org/jira/browse/HIVE-10720?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14567905#comment-14567905
 ] 

Aihua Xu commented on HIVE-10720:
---------------------------------

Thanks [~viraj] I updated the pig script above which you can use to check 
against your local repository. I ran against Pig 0.12, but seems like the 
version shouldn't matter.  

> Pig using HCatLoader to access RCFile and perform join but get incorrect 
> result.
> --------------------------------------------------------------------------------
>
>                 Key: HIVE-10720
>                 URL: https://issues.apache.org/jira/browse/HIVE-10720
>             Project: Hive
>          Issue Type: Bug
>          Components: HCatalog
>    Affects Versions: 1.3.0
>            Reporter: Aihua Xu
>            Assignee: Aihua Xu
>         Attachments: HIVE-10720.patch
>
>
> {noformat}
> Create table tbl1 (c1 string, c2 string, key string, value string) stored as 
> rcfile;
> Create table tbl2 (key string, value string);
> insert into tbl1 values('c1', 'c2', '1', 'value1');
> insert into tbl2 values('1', 'value2');
> {noformat}
> Pig script:
> {noformat}
> tbl1 = LOAD 'tbl1' USING org.apache.hive.hcatalog.pig.HCatLoader();
> tbl2 = LOAD 'tbl2' USING org.apache.hive.hcatalog.pig.HCatLoader();
> src_tbl1 = FILTER tbl1 BY (key == '1');
> prj_tbl1 = FOREACH src_tbl1 GENERATE
>            c1 as c1,
>            c2 as c2,
>            key as tbl1_key;
>            
> src_tbl2 = FILTER tbl2 BY (key == '1');
> prj_tbl2 = FOREACH src_tbl2 GENERATE
>            key as tbl2_key;
>            
> result = JOIN prj_tbl1 BY (tbl1_key), prj_tbl2 BY (tbl2_key);
> dump result;
> {noformat}
> You will see result "(,,1,1)" and we are missing c1 and c2 values.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to