I think that you are essentially correct. However, this is only going to affect a small number of games where two different moves are exactly tied for the best winning percentage, after many playouts. Even if the underlying probabilities are exactly the same, you can't really expect this to happen much.
Weston On 2/11/07, Heikki Levanto <[EMAIL PROTECTED]> wrote:
But there is a way. If we do N play-outs, the effect of any single of them is 1/N. If we make sure to scale the score to be less than half of this, it can not disturb anything in cases where the number of wins is different. Only in cases with exactly the same number of wins in the play-outs, would the score break the tie. In other words my "large" constant of 1000 was far too small. It would have to be something like 2NM, where M is the maximum score (say 361). Round it up to 1000N, and we should be safe. I still believe it would make endgames look more reasonable, and possibly even better, in case the winning program has overlooked a detail somewhere, having a large margin of points on the board should act as an insurance against small blunders.
_______________________________________________ computer-go mailing list computer-go@computer-go.org http://www.computer-go.org/mailman/listinfo/computer-go/