Antoine Pitrou added the comment:

I'm not sure what "inconsistent" means. If the results are unstable between 
runs, it may mean the operations being measured themselves are unstable (for 
example because of hashing differences or cache aliasing effects from run to 
run).

I'd rather like benchmarks to be judged on their usefulness:
- spectral_norm really looks pointless as nobody would write scientific code in 
Python like that
- telco, AFAIU, is a widely-used benchmark for decimals (but perhaps Stefan can 
shed some light)
- regex_v8 claims to be drawn from real-world use of regular expressions by 
popular Web pages, so it sounds useful as well

(note that telco apparently loads a file in the main loop, perhaps that can be 
pulled out of the loop and into the init phase)

----------
nosy: +haypo, yselivanov

_______________________________________
Python tracker <rep...@bugs.python.org>
<http://bugs.python.org/issue26416>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe: 
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

Reply via email to