Actually its collect or collectset + concat_ws udf.

Edward

On Sun, May 13, 2012 at 9:04 PM, Edward Capriolo <edlinuxg...@gmail.com> wrote:
> I just through a derivative of collectset called collect but we need
> some type of string join that takes an array and a delimiter.
>
> https://github.com/edwardcapriolo/hive-collect
>
> On Fri, May 11, 2012 at 4:05 PM, Edward Capriolo <edlinuxg...@gmail.com> 
> wrote:
>> The main issue with group_concat is that aggregates have to keep each
>> column in memory and that is a big problem. If The user knows the list
>> will be small you could write a UDAF like collectset, collect which
>> puts each value into a list and then you can lateral view that list.
>>
>> Edward
>>
>> On Fri, May 11, 2012 at 4:02 PM, Saurabh S <saurab...@live.com> wrote:
>>>
>>> As far as I understand, there is no equivalent of MySQL group_concat() in 
>>> Hive. This stackoverflow question is from Sept 2010: 
>>> http://stackoverflow.com/questions/3703740/combine-multiple-rows-into-one-space-separated-string
>>>
>>> Does anyone know any other method to create a delimited list from from 
>>> table rows? There are a few methods listed in Anthony Molinaro's SQL 
>>> Cookbook, but all of them use over(partition by) method which is also not 
>>> available in Hive.
>>>

Reply via email to