Re: [computer-go] MC Go Effectiveness

Matt Gokey Wed, 07 Feb 2007 05:15:39 -0800

Jacques Basaldúa wrote:

Very good analysis and I would like to contribute a 4th reason:
As Luke Gustafson pointed out, we have to contemplate the simulation
as a _stochastic process_. We want to determine the conditionalprobability of a win given a particular move is made. And that dependson the _length of the simulation_. Dramatically! This is a reasonagainst scalability of global search MC/UCT. If the simulation is
500 moves long (Chinese rules with recaptures, etc.) the observed
variance at an early move blurs out everything.

Just a simple stochastic process: Count a dollar each time you
correctly predict a p=1/2 coin thrown n=500 times. The expected
average is (obviously) 250, but the expected variance of that measure isn·p·(1-p) = 125 proportional to n.

Good point. This leads to another thought that I have been wonderingabout. That is I question whether using more time to search moresimulations in the opening is the best approach. For the opening,selecting reasonable robust moves that tend to lead to more favorableoptions is probably a good objective. The lengths of the simulation areperhaps too long to expect anything better. Later towards thepre-middle to middle game it is very critical to play such that thepositions tactical potential is exploited such to secure connections andeye space, etc. It would seem to me that focusing the highestconcentration of time and number of simulations during this part of thegame might be most advantageous.

It would be interesting for someone with a decent MC player to do anexperiment like this with one version concentrating highest number ofsimulations in the opening and one concentrating in the middle game, butotherwise equal and see which version wins more often.

_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Re: [computer-go] MC Go Effectiveness

Reply via email to