On 5/11/07, Brian Slesinsky <[EMAIL PROTECTED]> wrote:

> Going along with this, the numbers won't add up (although I don't know
> if that is important.)   In other words, if you do 10,000 simulations at
> the root, all grandchildren will add up to more (due to transpositions.)
> If you propogate this up the tree you might come up with many more than
> 10,000 simulations at the root.

Maybe this is obvious, but it seems like the numbers do add up if you
think about it differently?   When the program does a simulation that
has multiple ancestors, it is effectively working on multiple
simulations at once (counting each transposition separately.)  The
count at each node would then be the number of virtual simulations
done.  You could count the number of actual simulations separately and
see what the ratio is between them at the root to see how effective
the tables are.




Consider a node who's had one child extensively evaluated through
transpositions.  When UCT does come to visit it, the sqrt(sum(child
simulations)) will be very high.  That will artificially favor exploration
of children with less simulations.  That will have a continuing negative
impact on the exploitation.

... but then once the parent gets updated with the sum(child simulations),
UCT may simply stop searching that branch for a long time... Because it
likely had a low average win rate, and now looks to be extensively
explored.  When UCT eventually does come back to it, the problem in the
previous paragraph will still apply.
_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Reply via email to