Stefan Behnel added the comment: I updated the pull request with a split version of _PyType_Lookup() that bypasses the method cache during slot updates. I also ran the benchmarks with PGO enabled now to get realistic results. The overall gain is around 15%.
Original: $ ./python -m timeit 'class Test: pass' 20000 loops, best of 5: 7.29 usec per loop Patched: $ ./python -m timeit 'class Test: pass' 50000 loops, best of 5: 6.15 usec per loop Patched with non-trivial bases: $ ./python -m timeit 'class Test(object): pass' 50000 loops, best of 5: 6.05 usec per loop $ ./python -m timeit 'class Test(type): pass' 50000 loops, best of 5: 6.08 usec per loop $ ./python -m timeit 'class Test(int): pass' 50000 loops, best of 5: 9.08 usec per loop I do not consider the optimisations a code degredation. There is one semantic change: the new function _PyType_LookupUncached() returns on the first error and might set an exception. I considered that better behaviour than the old function, but it means that if one base class name lookup fails and the next one previously succeeded, it will no longer succeed now. I don't have a strong feeling about this and would change it back if compatibility is considered more valuable. It generally feels wrong to have errors pass silently here, however unlikely they may be in practice. ---------- _______________________________________ Python tracker <rep...@bugs.python.org> <http://bugs.python.org/issue31336> _______________________________________ _______________________________________________ Python-bugs-list mailing list Unsubscribe: https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com