Antoine Pitrou added the comment: I'm not sure what "inconsistent" means. If the results are unstable between runs, it may mean the operations being measured themselves are unstable (for example because of hashing differences or cache aliasing effects from run to run).
I'd rather like benchmarks to be judged on their usefulness: - spectral_norm really looks pointless as nobody would write scientific code in Python like that - telco, AFAIU, is a widely-used benchmark for decimals (but perhaps Stefan can shed some light) - regex_v8 claims to be drawn from real-world use of regular expressions by popular Web pages, so it sounds useful as well (note that telco apparently loads a file in the main loop, perhaps that can be pulled out of the loop and into the init phase) ---------- nosy: +haypo, yselivanov _______________________________________ Python tracker <rep...@bugs.python.org> <http://bugs.python.org/issue26416> _______________________________________ _______________________________________________ Python-bugs-list mailing list Unsubscribe: https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com