Re: get most common number in a list with tolerance

Gerard Flanagan Fri, 20 Feb 2009 02:16:41 -0800

Astan Chee wrote:

Hi,
I have a list that has a bunch of numbers in it and I want to get themost common number that appears in the list. This is trivial because ican do a max on each unique number. What I want to do is to have atolerance to say that each number is not quite unique and if thedifference from other numbers is small, than it can be counted together.This is what I have to get the common number (taken from the internetsomewhere):
l = [10,30,20,20,11,12]
d = {}
tolerance = 5
for elm in l:
   d[elm] = d.get(elm, 0) + 1
counts = [(j,i) for i,j in d.items()]



This of course returns a list where each of them is unique

[(1, 12), (1, 10), (1, 11), (2, 20), (1, 30)]

but I am expecting a list that looks like this:

[(3, 10), (2, 20), (1, 30)]


Maybe check for this number: tolerance * ( X / tolerance)

or a variation if tolerance is non-integer?

Eg. (rough):


from itertools import groupby


def tmax(seq, alpha):
    longest = []
    for k, g in groupby(sorted(seq), lambda x: alpha * (x / alpha)):
        g = list(g)
        if len(g) > len(longest):
            longest = g
    return longest[0]

a = [10, 30, 20, 20, 11, 12]


assert tmax(a, 5) == 10
assert tmax(a, 1) == 20


--
http://mail.python.org/mailman/listinfo/python-list

Re: get most common number in a list with tolerance

Reply via email to