In my non-go MCTS games, I usually score playouts on a continuous 
-1:1 scale rather than as +1 or -1.   I use the same arithmetic to
update UCT values, and it seems to work at least as well as strict
win/loss scoring.   

The motivation for this is to allow the playouts to be stoped at any
chosen point, not just at the end of game; and for actually ended
games, to allow the score to give feedback about "better" wins.

_______________________________________________
Computer-go mailing list
[email protected]
http://dvandva.org/cgi-bin/mailman/listinfo/computer-go

Reply via email to