Hi! On Tue, Apr 07, 2015 at 12:03:12PM +0200, Urban Hafner wrote: > Now that I have a bit of time again, what would be a good starting point to > improve upon UCT and light playouts? RAVE definitely comes to mind, as well > as enhancing the playouts with heuristics like the MoGo 3x3 patterns.
Well, my suggestions would be in the form of Michi (and its git history). ;-) > there are any good papers on adding priors to the search tree (and where > the underlying data is coming from)? I'm sure there must be, but I think I > just don't know how to search for it. Basically, you can either do progressive widening / unpruning / bias. The terminology is rather confusing. In case of progressive bias, you can either initialize the winrate with N wins (positive bias) and M losses (negative bias) with N and M determined by various heuristics (Fuego, Pachi, Michi use this), or have (1-alpha)*winrate + alpha*bias with bias being a "hypothetical winrate" determined by the heuristics and alpha: 1 -> 0 as #simulations: 0 -> infty (e.g. alpha=sqrt(c/n) or some other random formula like that). I know about no good survey papers personally. On Tue, Apr 07, 2015 at 07:20:37PM +0900, Hideki Kato wrote: > For prior values in the tree, almost(?) all strong programs use Remi's > method these days. > http://remi.coulom.free.fr/Amsterdam2007/MMGoPatterns.pdf Do you mean all the strong programs do progressive widening? -- Petr Baudis If you do not work on an important problem, it's unlikely you'll do important work. -- R. Hamming http://www.cs.virginia.edu/~robins/YouAndYourResearch.html _______________________________________________ Computer-go mailing list Computer-go@computer-go.org http://computer-go.org/mailman/listinfo/computer-go