Counting things fast - was Re: Summing a 2D list

Gerhard Häring Thu, 12 Jun 2008 08:16:53 -0700

Aidan wrote:

does this work for you?


users = [1,1,1,2,2,3,4,4,4]
score = [0,1,5,3,1,2,3,3,2]

d = dict()

for u,s in zip(users,score):
  if d.has_key(u):
    d[u] += s
  else:
    d[u] = s

for key in d.keys():
  print 'user: %d\nscore: %d\n' % (key,d[key])

I've recently had the very same problem and needed to optimize for thebest solution. I've tried quite a few, including:


1) using a dictionary with a default value

d = collections.defaultdict(lambda: 0)
d[key] += value

2) Trying out if avoiding object allocation is worth the effort. UsingCython:


cdef class Counter:
    cdef int _counter
    def __init__(self):
        self._counter = 0

    def inc(self):
        self._counter += 1

    def __int__(self):
        return self._counter

    def __iadd__(self, operand):
        self._counter += 1
        return self

And no, this was *not* faster than the final solution. This counterclass, which is basically a mutable int, is exactly as fast as justusing this one (final solution) - tada!


counter = {}
try:
    counter[key] += 1
except KeyError:
    counter[key] = 1

Using psyco makes this a bit faster still. psyco can't optimizedefaultdict or my custom Counter class, though.


-- Gerhard

--
http://mail.python.org/mailman/listinfo/python-list

Counting things fast - was Re: Summing a 2D list

Reply via email to