I've been looking at RAVE (Rapid Action Value Estimate), which MoGo uses.  The
score of states during simulation is stored in state-action pairs, which are
all updated with the playouts, rather than just those states visited in the
tree.  How would you store these scores?  The number of potential states
visited seems prohibitively large.

Jason Galbraith
Orego research group


_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Reply via email to