Re: [computer-go] Random

Don Dailey Fri, 16 May 2008 11:54:34 -0700


Zach Wegner wrote:

What you could do is XOR the RDTSC into the seed (or array or whatever
your RNG uses) at the beginning of each playout. That adds to the
chaos but doesn't slow it down much at all.

I like your idea. Yes, you could probably use a really fast simple RNGand do this. AnchorMan had a very trivial RNG and I noticed that ifyou played enough games, you start getting the same games over andover. So I could only play a few hundreds before this startedhappening (for instance playing anchor vs anchor.) It seems likewith 4 billion possible seeds that this would not happen (each gamesstarted with a seed based on the time() call.) But apparentlysomething was going on that I didn't understand because there seemed tobe only a few hundred games possible (at any fixed level.)I switched over to Mersenne Twister and this problem went away. MT didnot improve the playing quality in any way I could notice, so I don'tbelieve MC in general requires a very good RNG.I once built a card playing program, and while developing a playingstrategy I tested different versions against each other. But then Itried as a sanity check, testing the same version against itself, and Inoticed a systematic bias, one particular side was winning somethinglike 53 or 54 percent of the games even though the conditions wereunbiased. I discovered the problem was with the RNG in the C library(this was a long time ago.) I solved the problem by changing the wayI shuffled the cards. Originally I created a deck from scratch whichalways had the same ordering, then I would apply the standardFischer-Yates shuffle. The fix was to reuse the deck from the lastgame and shuffle THAT deck. Basically I grabbed up all the cards inthe players hands and on the "table" and put them back in the deck justlike us humans do when playing cards, then I would shuffle them. Theeffect was that I implicitly created a more sophisticated RNG with lotsof state and more than likely a very long cycle time compared to thesimple one I started with.

This could be done with your go program too. If you have some sort oflist of all the legal points at the beginning of the game that you workwith and manipulate, just leave it alone instead of reinitializingit. Or let the move sequences of the previous game impact the initialordering or how the game is played. You would be getting a moresophisticated RNG for free. Of course you have to save state betweenprogram invocations and that's probably too ugly.

- Don

_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Re: [computer-go] Random

Reply via email to