I can replicate these results on Pig 0.14.
Did anyone open a Jira issue for this?
On Thursday, March 10, 2016 12:24 PM, Sarath Sasidharan
<[email protected]> wrote:
Hi All,
I have a script which stores 2 relations with different schema using
CSVExcelStorage.
The issue which i see is that the script picks up the last store function and
takes the schema in that and puts it for all store functions , overriding the
previous store schemas.Is this a known issue and is there a fix for this ?
My Sample Script Looks like this :--
=============================================================
masterInput = load 'hbase://xyz' using
org.apache.pig.backend.hadoop.hbase.HBaseStorage(
'f:a,f:b,f:c,f:d')
as (a,b,c,d);
input2 = foreach masterInput
generate
a,b;
input3 = foreach masterInput
generate
c,d;
store input2 into '/dir/ab'
using org.apache.pig.piggybank.storage.CSVExcelStorage('\t','YES_MULTILINE',
'UNIX', 'WRITE_OUTPUT_HEADER');
store input3 into '/dir/cd'
using org.apache.pig.piggybank.storage.CSVExcelStorage('\t','YES_MULTILINE',
'UNIX', 'WRITE_OUTPUT_HEADER');
=============================================================
Expected Output :
file 1 file 2
a,b c,d
10,20 30,40
Actual Output :
file 1 file 2
c,d c,d
10,20 30,40
Thanks and Regards,
Sarath Sasidharan