Wesley Turner and me are analysing "pure Monte Carlo" (without tree search component) in (artificial) simple games. Let MC(k) be the version where each candidate move gets exactly k simulations. We found examples where for instance MC(8) performs better than MC(16), but for k to infinity pure Monte Carlo plays perfectly.
I guess that what is needed for that is something like: * a best move * several bad moves that look good to MC * even more intermediate moves that look slightly less good to MC. So that for MC(8), we typically get a move of the third category, for MC(16), unless lucky, of the secont category, and of course the best move for MC(\infty).
Question to the MCTS programmers (in Go): Do you have example positions where MCTS makes good move proposals for short run times and bad proposals for medium long run times? (Of course for time to infinity it should converge to good moves.)
I guess it would be easier still: make a tactical position, where the opponent has only one road against a bad move. Until the tree finds it, it will look better than a good move that gives a few points. And if there are many slightly less good moves that give more points, they will be preferred at first. Of course, since we really have time when we have a tree, we could also probably create a position with "refutations" at different depths, the move with shorter-depth refutations having less dire consequences. Now, that's theory-crafting, I have no example to give. Jonas _______________________________________________ Computer-go mailing list [email protected] http://dvandva.org/cgi-bin/mailman/listinfo/computer-go
