I doesn't seem too fast based on my test, but perhaps there are ways to optimize it.

I would like to mention that your generator is 64 bit and the other generator was 32 bits but I'm running on a 64 bit core 2 duo OS and do not know how much of a factor this is or isn't.

If RDTSC actually is too slow, a way around it is to design the RDTSC to be called only occasionally. But this involves a branching instruction which is probably going to hurt it too. I wonder if you could have a "burst" version that generates several at a time and queues them up? Then you grab them off the queue until you run out, then you insert some entropy with RDTSC and generate 16 more numbers or something like that. This still requires a branch instruction. Of course you are inserting far less entropy into the system if you only do it occasionally, but I would bet that you have several bits of entropy in this case, because a lot is going on before you get to the next call.
- Don



A van Kessel wrote:
The question I have is how much does RDTSC slow this down? I may do
I don't know.
If I understand correctly, rdtsc can be executed in parallen with *some*
other instructions (and places no "barrier" in the code), in which case
it could be executed at no additional cost (except the instruction fetching,
of course)
I have not yet been able to instruct GCC to use the mmx or sse3 instructions/
registers to do the shifting. Code could be reshuffeled to allow the
compiler more freedom.

AvK
_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Reply via email to