UCT is based on a theory of a multi-armed bandit, with uncertain knowledge 
about which "arms" would be most productive. Is it possible to graft various 
sources of knowledge into a sort of meta-bandit algorithm?

As to fusing top-level knowledge with random playouts, I love the idea, and am 
trying to imagine  how to implement it. One idea is this: certain moves in 
certain situations might trigger a forced reply. Playing one of a pair of miai, 
for instance, should result in a high probability of the matching move played 
as a response - especially if failure to do so would result in killing a group. 
Analysis of a group could conclude that life depends on certain external 
liberties, or the ability to play one of two alternate moves, yadda yadda; 
those threats then trigger appropriate automatic responses with high 
probability.
 
Regarding the scalability study, the results are tricking in more slowly now, I 
think. Is the number of machines in use the same as before? I'm very curious 
about that flat spot for Mogo-16, 17, and 18. ( 
http://cgos.boardspace.net/study/index.html )


Terry McIntyre <[EMAIL PROTECTED]> 





      
____________________________________________________________________________________
Looking for last minute shopping deals?  
Find them fast with Yahoo! Search.  
http://tools.search.yahoo.com/newsearch/category.php?category=shopping
_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Reply via email to