Re: [HACKERS] MD5 aggregate

Benedikt Grundmann Fri, 14 Jun 2013 06:23:45 -0700

On Fri, Jun 14, 2013 at 2:14 PM, Tom Lane <t...@sss.pgh.pa.us> wrote:


> Marko Kreen <mark...@gmail.com> writes:
> > On Thu, Jun 13, 2013 at 12:35 PM, Dean Rasheed <dean.a.rash...@gmail.com>
> wrote:
> >> Attached is a patch implementing a new aggregate function md5_agg() to
> >> compute the aggregate MD5 sum across a number of rows.
>
> > It's more efficient to calculate per-row md5, and then sum() them.
> > This avoids the need for ORDER BY.
>
> Good point.  The aggregate md5 function also fails to distinguish the
> case where we have 'xyzzy' followed by 'xyz' in two adjacent rows
> from the case where they contain 'xyz' followed by 'zyxyz'.
>
> Now, as against that, you lose any sensitivity to the ordering of the
> values.
>
> Personally I'd be a bit inclined to xor the per-row md5's rather than
> sum them, but that's a small matter.
>
>                         regards, tom lane
>
>
xor works but only if each row is different (e.g. at the very least all
columns together make a unique key).




>
> --
> Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
> To make changes to your subscription:
> http://www.postgresql.org/mailpref/pgsql-hackers
>

Re: [HACKERS] MD5 aggregate

Reply via email to