Le 2014-07-28 à 10:58, David G Johnston <david.g.johns...@gmail.com> a écrit :

> François Beausoleil wrote
>> Hi all,
>> 
>> NOTE: Also posted to StackOverflow:
>> http://stackoverflow.com/questions/24997131/pairwise-array-sum-aggregate-function
>> 
>> I have a table with arrays as one column, and I want to sum the array
>> elements together:
>> 
>>> create table regres(a int[] not null);
>>> insert into regres values ('{1,2,3}'), ('{9, 12, 13}');
>>> select * from regres;
>>     a
>> -----------
>> {1,2,3}
>> {9,12,13}
>> 
>> I want the result to be:
>> 
>> {10, 14, 16}
>> 
>> that is: {1 + 9, 2 + 12, 3 + 13}.
>> 
>> Does such a function already exist somewhere? The intagg extension looked
>> like a good candidate, but such a function does not already exist.
>> 
>> The arrays are expected to be between 24 and 31 elements in length, all
>> elements are NOT NULL, and the arrays themselves will also always be NOT
>> NULL. All elements are basic int. There will be more than two rows per
>> aggregate.
>> 
>> My implementation target is:
>> 
>> PostgreSQL 9.1.13 on x86_64-unknown-linux-gnu, compiled by gcc
>> (Ubuntu/Linaro 4.6.3-1ubuntu5) 4.6.3, 64-bit
> 
> This should be doable with pl/pgsql, CREATE FUNCTION, and CREATE AGGREGATE
> 
> You basically want to write a "sum" function that accepts an array input. 
> You will need to explode both the stored state and the incoming data, add
> them together, then reconstruct a new array to put back into the state.  The
> final function would just return the current contents of state.
> 
> With a different language you may be able to optimize by using a native
> array capabilities of the language but with pl/pgsql I think you will have
> to do the explode/rebuild.
> 
> Another implementation possibility is to keep track of the number of input
> rows but simply concatenate the new data onto the end of the old data.  In
> the final function you would break apart (unnest) the array into a single
> long set and then use division/modulo to identify which items belong
> together and add them.  You then convert the result back into an array and
> return.  This should save considerable array exploding time.

Erwin Brandstetter posted a great answer on StackOverflow 
http://stackoverflow.com/a/24997565/7355

SELECT ARRAY(
  SELECT sum(arr[rn])
  FROM (
    SELECT arr, generate_subscripts(arr, 1) rn
    FROM tbl t) sub
  GROUP BY rn
  ORDER BY rn
);

I like the simplicity of the solution, vs creating a new aggregate function.

Thanks everyone!
François Beausoleil



-- 
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general

Reply via email to