Re: [computer-go] Monte-Carlo Tree Search reference bot

Mark Boon Wed, 03 Dec 2008 03:55:43 -0800


On 3-dec-08, at 06:09, Sylvain Gelly wrote:

What I did (was a "long" time ago, I don't know if it is still used in
Mogo), is to compute the m best moves every so often and most of the
time just do the max over those m moves.
m was on the order of 5, and "every so often" was an increasing
function like sqrt or log (I don't remember).
That speeds up quite a lot (when the max becomes the bottleneck), and
if you tune the parameters well you almost don't loose any strength at
a fixed number of playouts.

I thought about that, but I was afraid the code would become tooobscure. After all, this is supposed to be a referenceimplementation. But maybe I should actually give it a try to see whatit would look like.

I also played with precomputing a table of the log() function butfound little speedup. Instead I decided to (p)recompute the log(nr-parent-visits) only once in 8 times. According to my profiler thatreduced the log() from using 8% to 1% of computation time. Anotherthing I did was replace log(nr-virtual-parent-visits) in the RAVEcalculation by the same log(nr-parent-visits) used in the UCTcalculation, cutting both the number of times I need to call log inhalf and reducing the amount of storage in the node a little. Afterthese modifications I didn't notice any degradation in play.

In terms of memory use, I did a few obvious things to keep storage ofthe nodes to a minimum. I had seen people write about memory usage ofthe tree, but never understood the concerns. But that was because Ialways expanded the tree one node at a time. Of course, if you createall the children in one go like this reference implementation does,it's a little bit a different story. But still, I have to push mycomputer hard to run out of space before I run out of time so Ididn't see much need to reduce memory too aggressively. Using just amoderately small number of playouts before expansion is alreadyenough to never have memory problems. But if memory is a concern toanyone I see two obvious ways to make make significant reductions.One is to flatten the data-structure, reducing the object overheadJava imposes. That could save up to a third. The other is to usesmaller placeholders for nodes that have been created, but not yetvisited. Once a node gets visited it can be replaced by a full-blownnode, but until then all you need is the move-coordinate and the AMAFstatistics. That should save up to 75% or so.


Mark

_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Re: [computer-go] Monte-Carlo Tree Search reference bot

Reply via email to