Hi everybody - especially Sylvain =) 

I'm wondering whether the formula to determine the balance between RAVE and UCT,
beta = sqrt(c / 3 * parentVisits + c),
has any mathematical background - or is it just a best guess for something that 
starts at 1 and is 1/2 after a certain number of visits?

Another question is about the prior integration. Apparently the prior, RAVE and 
UCT values are three different estimators for the winning probability. So why 
not use the above formula for prior vs. RAVE balancing, too, instead of 
initializing RAVE with it?

Regards,
Benjamin
__________________________________________________________________________
Erweitern Sie FreeMail zu einem noch leistungsstärkeren E-Mail-Postfach!        
        
Mehr Infos unter http://produkte.web.de/club/?mc=021131

_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Reply via email to