On 2013-04-01 23:34, Alexander Graf wrote:
Is this faster than a load/store with std/ldbrx?
Hmm. Almost certainly not. And since we've got stack space allocated for function calls, we've got scratch space to do it in. Probably similar for bswap32 too, eh? I'll do a tiny bit o benchmarking for power7. r~