I've been looking at RAVE (Rapid Action Value Estimate), which MoGo uses. The score of states during simulation is stored in state-action pairs, which are all updated with the playouts, rather than just those states visited in the tree. How would you store these scores? The number of potential states visited seems prohibitively large.
Jason Galbraith Orego research group _______________________________________________ computer-go mailing list computer-go@computer-go.org http://www.computer-go.org/mailman/listinfo/computer-go/