On Sat, 2008-02-09 at 11:50 +0100, Olivier Teytaud wrote: > > I think it is time to share this idea with the world :-) > > The idea is to estimate bias and variance to calculate the best combination > > of UCT and RAVE values. > > I have attached a pdf explaining the new formula. > > It is written in the pdf file that the formula is the one in MoGo; > but in MoGo there's no sqrt(log...), i.e. c_r=c_u=0.
That translates to mean that MoGo no longer uses upper confidence bounds, and only uses means. It also means that MoGo will _never_ explore improbable children (after a few sims) unless the RAVE value yields an unusually high estimate for it. Is all of that correct? _______________________________________________ computer-go mailing list computer-go@computer-go.org http://www.computer-go.org/mailman/listinfo/computer-go/