Hi everybody - especially Sylvain =) I'm wondering whether the formula to determine the balance between RAVE and UCT, beta = sqrt(c / 3 * parentVisits + c), has any mathematical background - or is it just a best guess for something that starts at 1 and is 1/2 after a certain number of visits?
Another question is about the prior integration. Apparently the prior, RAVE and UCT values are three different estimators for the winning probability. So why not use the above formula for prior vs. RAVE balancing, too, instead of initializing RAVE with it? Regards, Benjamin __________________________________________________________________________ Erweitern Sie FreeMail zu einem noch leistungsstärkeren E-Mail-Postfach! Mehr Infos unter http://produkte.web.de/club/?mc=021131 _______________________________________________ computer-go mailing list computer-go@computer-go.org http://www.computer-go.org/mailman/listinfo/computer-go/