Where are heuristics applied in UCT Go? The following is a review of the literature regarding application of heuristics to UCT Go. It explicitly does not address questions of how the heuristics are generated nor of other algorithmic improvements.
Numbers are section numbers within papers. I eagerly look forward to any comments, corrections, or expansions. Peter Drake http://www.lclark.edu/~drake/ Chaslot G.M.J.B., Winands M.H.M. Winands, Uiterwijk J.W.H.M., van den Herik H.J., and Bouzy B. Progressive strategies for Monte-Carlo tree search. 1.3.1: Add heuristic value (divided by # of playouts) to UCT value. 1.3.2: Progressive widening, similar ot Coulom below. Rémi Coulom. Computing Elo ratings of move patterns in the game of Go. 4.1: Heavy playouts, probability distribution. 4.2: Progressive widening in tree, add nth best heuristic move after a certain number of playouts. Peter Drake and Steve Uurtamo. Heuristics in Monte Carlo Go. 4: Heavy playouts, roulette wheel. Sylvain Gelly and David Silver. Combining online and offline knowledge in UCT. 5: Heavy playouts, probability distribution, with noise introduced in a variety of ways. 7: Value of new leaf is initialized by heuristics. Sylvain Gelly, Yizao Wang, Rémi Munos, and Olivier Teytaud. Modification of UCT with patterns in Monte-Carlo Go. 3.2: Heavy playouts, play any move suggested by heuristics, otherwise randomly. 3.4: Ordering of child creation in tree.
_______________________________________________ computer-go mailing list computer-go@computer-go.org http://www.computer-go.org/mailman/listinfo/computer-go/