On Sat, 2008-02-09 at 11:50 +0100, Olivier Teytaud wrote:
> > I think it is time to share this idea with the world :-)
> > The idea is to estimate bias and variance to calculate the best combination 
> > of UCT and RAVE values.
> > I have attached a pdf explaining the new formula.
> 
> It is written in the pdf file that the formula is the one in MoGo;
> but in MoGo there's no sqrt(log...), i.e. c_r=c_u=0.

That translates to mean that MoGo no longer uses upper confidence
bounds, and only uses means.  It also means that MoGo will _never_
explore improbable children (after a few sims) unless the RAVE value
yields an unusually high estimate for it.  Is all of that correct?  

_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Reply via email to