Hi,
2007/10/8, Benjamin Teuber <[EMAIL PROTECTED]>: > > Hi everybody - especially Sylvain =) > > I'm wondering whether the formula to determine the balance between RAVE > and UCT, > beta = sqrt(c / 3 * parentVisits + c), > has any mathematical background - or is it just a best guess for something > that starts at 1 and is 1/2 after a certain number of visits? No it is just a tuning.... :) Another question is about the prior integration. Apparently the prior, RAVE > and UCT values are three different estimators for the winning probability. > So why not use the above formula for prior vs. RAVE balancing, too, instead > of initializing RAVE with it? > Our prior is actually classical and equivalent to a Dirichlet prior for the RAVE value. Of course we could put the prior in other ways, put I strongly believe that at this point the relevance of the prior is more important that the way you use it. Cheers, Sylvain
_______________________________________________ computer-go mailing list computer-go@computer-go.org http://www.computer-go.org/mailman/listinfo/computer-go/