Re: [computer-go] JCIS extended abstract

Jason House Thu, 17 May 2007 20:13:53 -0700

Chaslot G (MICC) wrote:

p_hat = (w_i + n_h*H_B)/(n_i+n_h)


Interesting... But then how do you compute n_h in practice

The mathematical derivation is based on estimating an a-prioriprobability distribution. In theory, one simply needs to run MCsimulations for a wide variety of heuristically identical situations,and then fit the best beta distribution to the measured data. A betadistribution has two parameters - alpha and beta. n_h = alpha+betaand n_h*H_B = alpha.

In practice... I don't have an MC bot yet. I'm slowly redoing my botin D (an up and coming programming language http://www.tiobe.com/tpci.htm).

For the full version of my paper I will compare different ways to modify the probability distribution according to knowledge.I believe there is no optimal way to do that :(

Well, if beta distributions are a good fit, then the above would be theoptimal probability distribution... Of course, my analysis doesn'ttake tree searches into... Maybe I'll get lucky and it'll work well likea multi-armed bandit. Actually, even if it doesn't, being optimalbefore it's time to build a subtree may be enough. I think I've seenstuff like waiting until doing 100 simulations. If n_h is relativelysmall, the effect is probably sufficiently washed out by then andoptimality probably doesn't matter.

I guess if empirical evidence shows beta distributions are a good fitand a high n_h is appropriate, then I'll have to revisit the shortcomings...

_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Re: [computer-go] JCIS extended abstract

Reply via email to