I have done a bit more testing and some things start to become a little more clear to me. But I still have some open questions.

What I did was to rewrite my search such that I can set a constant that determines after how many simulations the tree gets expanded. And I have a switch to turn AMAF on or off. When the tree is expanded always with no AMAF this should behave the same as my original search algorithm. There's one caveat though. In the UCT formula sqrt( 2 * (log(parent-visits) / (10* visits)) the number of visits can not be zero of course. In my original implementation this was never the case, as it always did a simulation and update of the visit count upon expansion of a node. In the new implementation when expansion occurs it creates many nodes with a visit-count of zero. So I have to have a provision: if (visits==0) return MAX_VALUE; That will guarantee it to behave the same as my original allgorithm. I have tested enough games now that I'm reasonably confident that they are indeed the same.

This provision overwhelmes any value in visited nodes. That is actually by design of course, forcing all the moves at the current depth to be visited at least once before expanding a level deeper. But when using AMAF you want to postpone visiting nodes as long as possible as long as you seem to have a very good candidate. I have a strong suspicion that in my current tests UCT+AMAF shows a playing level very close to just plain UCT simply because MAX_VALUE overwhelmes any AMAF statistics gathered.

So I think if we're going to make AMAF part of the reference definition of a UCT search bot, then we need some consensus about what is a good formula for the 'virtual-value'. Especially for the case where vists==0. I call it 'virtual-value' where a node-value is virtual-value + win/visits. Part of virtual-value will be the UCT value, part of it will be based on the virtual-win/virtual-visits ratio. I can probably invent such a formula myself but I'd be interested to hear what people that have had a UCT-AMAF bot a bit longer are using that has been tried and tested.

Mark

_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Reply via email to