[computer-go] MoGo v.s. Kim rematch (Jason House's paper)

Jacques Basaldúa Wed, 24 Sep 2008 11:40:54 -0700

> "The approach of this paper is to treat all win rate estimations asindependent estimators with

additive white Gaussian noise. "

Have you tried if that works? (As Łukasz Lew wrote "experimental setupwould be useful") I guessthere may be a flaw in your idea, but I am not a specialist. I will tryto explain it.

If it wasn't for the fact that the tree is learning, the probability ofa playout through a node to winwould be constant each time the node is visited. This is, of course, asimplification because the treedoes learn, but, at least between playouts that are not very distant intime, it is true. So my argumentholds to some (I guess, much) extent. The same applies to the RAVEestimator which is also the resultof counting wins (assume P(win|that move) = constant) and dividing bysome appropriate sample size.Therefore, these estimators follow a binomial distribution. It doesconverge to the normal, but withsome fundamental caveat: Unlike the normal in which mean an variance areindependent, in this case

the variance is a function of p.

The variance of the binomial = n·p·(1-p) is a _function of p_.

Therefore, the variance of the normal that best approximates thedistribution of both RAVE and

wins/(wins + losses) is the same n·p·(1-p)

If this is true, the variance you are measuring from the samples doesnot contain any informationabout the precision of the estimators. If someone understands thisbetter, please explain it to

the list.

Jacques.*
*
_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

[computer-go] MoGo v.s. Kim rematch (Jason House's paper)

Reply via email to