On Fri, Oct 28, 2011 at 02:02:28PM -0700, Dave Dyer wrote:
> 
> >You have missed a key point. UCT is an ideal framework for integrating many
> >potentially incomplete, contradictory, or even potentially wrong, knowledge
> >sources. There are a zillion ways to add knowledge, and UCT guarantees that
> >you won't pay an asymptotic price.
> 
> 
> With random playouts, there is no place to add knowledge.  

Huh, what about node priors?

> With heavier playouts, the "signal" from the win rate can still
> be very weak, or even wrong. 

Of course, if it couldn't be wrong, we would have perfect programs. :)

My experience is that RAVE biases moves by knowledge much more strongly
than UCT, since frequent heavy playout choices are greatly magnified.

> UCT is particularly weak when a timely move is required - making the
> correct move 2 moves later looks 99% as good as making it immediately.

I can't remember hitting anything like that much.

It ain't always easy to integrate knowledge in UCT, but while I don't
have much practical experience with other game tree search algorithms,
I'd imagine it's even trickier with the classic evaluation function
approach.

-- 
                                Petr "Pasky" Baudis
We live on an island surrounded by a sea of ignorance. As our island
of knowledge grows, so does the shore of our ignorace. -- J. A. Wheeler
_______________________________________________
Computer-go mailing list
[email protected]
http://dvandva.org/cgi-bin/mailman/listinfo/computer-go

Reply via email to