Hi,
 
While recently implementing UCT, I've come across two cases where one may want 
to increase the number of visits to a node by something greater than 1.
 
A) Leaf nodes - Artifically set the number of visits to some very high number.  
The rationale for this is to accelerate propagation of the leaf condition "up 
the tree" so as not to waste future time descending on parts of the tree which 
have a known outcome.  Does that make sense?
 
B) In the section "UCT with Prior Knowledge" within the paper "Combining Online 
and Offline Knowledge in UCT' Gelly/Silver mention seeding the visits with the 
"equivalent experience" contained in the prior value function, eg.
 
n(s,a) <- n-prior(s,a)
 
Therefoe, I assume that in Gelly/Silver's "updateValue(node,value), the line:
 
node[i].nb = node[i].nb + 1;
 
is replaced with something like:
 
node[i].nb = node[i].nb + k;
 
where k >= 1
 
This raises two questions:
 
1) What is an effective way of dealing with leaf nodes?  Is what I describe 
above in A) typical?
 
2) Is it really OK to increase the number of visits and propagate them "up the 
tree" this way?  Does it break any of the statistical assumptions inherent in 
the algorithm.  I suppose it must be "OK" in that it is mentioned in the paper.
 
Sorry if these questions seem basic, but I think it's important to get the 
details right.
 
Thanks, I appreciate any further insights.
-- Greg


      
_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Reply via email to