It's really more of a HiveQL trick than a SQL trick.  In SQL, one would express 
it as a subquery, which is more straightforward and readable, and has the 
additional advantage that you could eliminate the aggregation and assume P and 
N are not true simultaneously, which would nicely produce an error were that 
assumption ever violated.  HiveQL's lack of subqueries forces this trick to use 
aggregation to simulate a merge.
 

________________________________
 From: Stephen Sprague <sprag...@gmail.com>
To: user@hive.apache.org 
Sent: Sunday, June 23, 2013 1:20 PM
Subject: Re: how to combine some rows into 1 row in hive
  


so its not a Hive issue at all then instead its clever sql tricks.  good luck 
maintaining that and hopefully you got some comments in your code for the next 
sap who comes along and has the privilege of reading  it. :)   





On Sat, Jun 22, 2013 at 5:38 PM, zuohua zhang <zuo...@gmail.com> wrote:

Thanks Michael! That worked without modification!
>
>
>
>On Sat, Jun 22, 2013 at 5:05 PM, Michael Malak <michaelma...@yahoo.com> wrote:
>
>Or, the single-language (HiveQL) alternative might be (i.e. I haven't tested 
>it): 
>> 
>>select f1,
>>       f2, 
>>       if(max(if(f3='P',f4,null)) is null,0,max(if(f3='P',f4,null))) pf4,     
>>  if(max(if(f3='P',f5,null)) is null,0,max(if(f3='P',f5,null))) pf5, 
>>       if(max(if(f3='N',f4,null)) is null,0,max(if(f3='N',f4,null))) nf4,
>>       if(max(if(f3='N',f5,null)) is null,0,max(if(f3='N',f5,null))) nf5 
>>from   mytable
>>group by f1, f2;
>> 
>>
>> 
>> From: Edward Capriolo <edlinuxg...@gmail.com>
>>To: "user@hive.apache.org" <user@hive.apache.org> 
>>Sent: Saturday, June 22, 2013 7:48 PM
>>Subject: Re: how to combine some rows into 1 row in hive
>> 
>>
>>
>>Using hives streaming feature is a nice option for this as it is a fairly 
>>natural way to work with entire rows and return multiple columns
>>
>>select transform a,b,c,d  using /bin/pipeprogram as a, b ,c ,d
>>
>>
>>
>>You an also write a UDTF user defined Table function as well because this can 
>>return more then one column.
>>
>>
>>
>>On Sat, Jun 22, 2013 at 6:36 PM, zuohua zhang <zuo...@gmail.com> wrote:
>>
>>I have the following table:
>>>f1 f2 f3 f4 f5
>>>a1 a2 P x1 x2 
>>>a1 a2 N x3 x4
>>>a1 a3 N x5 x6
>>>a4 a6 P x7 x8
>>>
>>>
>>>i want to convert to below:
>>>f1 f2 pf4 pf5 nf4 nf5
>>>a1 a2 x1 x2 x3 x4
>>>a1 a3 0 0 x5 x6
>>>a4 a6 x7 x8 0 0
>>>
>>>
>>>basically, when f3="P", I want f4 f5 to be moved to pf4 pf5 
>>>when f3="N", i want f4 f5 to be moved to nf4 nf5
>>>when there is no "P" or "N" rows for the record, I want to fill in zeros in 
>>>the corresponding fields.
>>>
>>>
>>>how to do it?
>>>
>>>
>>>Thanks,
>>>Zuohua

Reply via email to