Re: [computer-go] 10k UCT bots

Don Dailey Tue, 13 May 2008 12:00:44 -0700


Don Dailey wrote:

Jason House wrote:
On May 13, 2008, at 1:51 PM, Mark Boon <[EMAIL PROTECTED]> wrote:
On 13-mei-08, at 14:10, Álvaro Begué wrote:
What others do is the right thing to do. Your method will introduce
some biases.
Could you elaborate what bias it could lead to? I also do the sameas Jason. I did consider the possibility of a bias but couldn'timmediately think of one.
And I was thinking "let's not repeat this topic"...
The probability of picking a point is proportional to 1 + number ofillegal points before it.
In practice, illegal moves are pretty rare until endgame. At thatpoint, it's a trade off between bias and speed. Random numbergeneration is not cheap. I have yet to see empirical proof that thepick and scan is bad. I've tried both methods in the past and saw nomeasurable difference in strength.
I may perhaps take a look at this myself. I think there is a way todeal with this issue and get the best of both worlds or at leastapproach it. If this method does not degrade the quality, it's amoot point. Otherwise, here is an idea:
1. For each move list size, pre-compute N tables of move listtraversal orderings.
  2. At move selection time alternative between them.
So you would have something in the C language like:

    int *order[S][C]

I forget to mention that S is the move list size (number of empty pointsyou are interested in) and C is the number of list orderings you want toalternate between. And a list is returns that is of size S-1 (sinceyour first move is selected by calling the random number generater.) Anumber like 8 or 16 is good (a power of two) because each time you usingthe list to select a move, you can increment a counter so that you use adifferent move list ordering the next time you are confronted with amove list of this same exact size.

Your first move is selected randomly, then from then on you use thearray returned to form an offset from this.This would pay off if you really wanted something very close touniform random and your random number generator was clearly expensive.I think there is a great deal of chaos that would make this "almost"perfectly uniform. The first move would be selected as you alreadydo, using the random number generator but after this you traverse thelist in random order as specified by this pre-computed table. Eventhough the bias is still there for any given move, it's basicallynon-predictable. On the very next move you will using a differenttable which you choose from in round robin fashion.Don't know how flawed this idea is. I believe it is would come closeto uniformly random but wouldn't be perfect. There would be a minorbias when measured over a lot of games, it would probably occur thata certain move was being chosen slightly more often, but nothing likethe bias in the current method.Is it worth it? Probably not, but just a crazy idea. Likely thereare simpler ways to eliminate most of the bias while still minimizingthe number of random move generations.
- Don
What good does moving it to the end of the list do? Next time aroundit's just as likely to be picked as when left in place. Or do youonly process the 'end' after the 'front' moves are all tried?
The range of the random number is reduced by one after each failedlookup. Shuffled data has no impact on future use of the array ofempty points.
Mark

_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/
------------------------------------------------------------------------

_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/
_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Re: [computer-go] 10k UCT bots

Reply via email to