QUESTION: Who has ideas how such a k-best mode for MCTS might best be organized? (Here k is the number of candidates who get large numbers of playouts - and thus "precise" evaluations.)
Just make the first round of bandits be: * choose K arms instead of one. Same rules otherwise. We choose the K arms with biggest UCB, or whatever criterion is used to choose one arm usually. Jonas _______________________________________________ Computer-go mailing list [email protected] http://dvandva.org/cgi-bin/mailman/listinfo/computer-go
