Hi,

2007/10/8, Benjamin Teuber <[EMAIL PROTECTED]>:
>
> Hi everybody - especially Sylvain =)
>
> I'm wondering whether the formula to determine the balance between RAVE
> and UCT,
> beta = sqrt(c / 3 * parentVisits + c),
> has any mathematical background - or is it just a best guess for something
> that starts at 1 and is 1/2 after a certain number of visits?


No it is just a tuning.... :)


Another question is about the prior integration. Apparently the prior, RAVE
> and UCT values are three different estimators for the winning probability.
> So why not use the above formula for prior vs. RAVE balancing, too, instead
> of initializing RAVE with it?
>

Our prior is actually classical and equivalent to a Dirichlet prior for the
RAVE value. Of course we could put the prior in other ways, put I strongly
believe that at this point the relevance of the prior is more important that
the way you use it.

Cheers,
Sylvain
_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Reply via email to