Re: [computer-go] Goal-directedness of Monte-Carlo

Michael Williams Mon, 08 Sep 2008 23:10:30 -0700

I don't think this specific test had been done. But I'm assuming the result will be the same as previous tests: deviating from <the pursuit of the the highestwinning percentage> leads to a degradation in strength.


Brett Koonce wrote:

Greetings from a lurker,
Forgive me if I am talking out of my hat. It has been a long time sinceI have done any real coding.
It seems most of the gains in MC/UCT come fairly quickly (or rather youcan get within 50% of a good move guess with a few iterations). Itwould be interesting to perhaps do a progressive stepping down/widening,i.e. 1k playouts with komi + 3 as the cutoff, then feed this tree into2k playouts with komi + 2, then 4k playouts with komi + 1, and thenfinally do the usual full blown regular analysis, say 50k playouts(numbers can be tweaked of course). You would lose the initialsimulations from your final one, so you would be sacrificing say 10% ofthe possible simulations, but on the other hand it would seem to biasthe tree toward making moves that have a greater chance of winning by agreedy amount without explicitly telling the computer it has to win by acertain number, which would seem dangerous if the simulations are nearthe threshold.
I apologize if this is an obvious idea, was just wondering if there wasa flaw with it/someone had done experiments in this direction already.
-Brett
_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/


_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Re: [computer-go] Goal-directedness of Monte-Carlo

Reply via email to