Re: [computer-go] Representation of state-action pairs

Jason House Wed, 02 Jul 2008 11:11:05 -0700


Sent from my iPhone

On Jul 2, 2008, at 12:37 PM, Jason Galbraith <[EMAIL PROTECTED]> wrote:

I've been looking at RAVE (Rapid Action Value Estimate), which MoGouses. Thescore of states during simulation is stored in state-action pairs,which areall updated with the playouts, rather than just those states visitedin thetree. How would you store these scores? The number of potentialstates
visited seems prohibitively large.

Jason Galbraith
Orego research group

I think it's more like "for each state (in the UCT tree), scores aretracked for every action"

When I asked in the past, RAVE values for actions taken in a game areapplied to all parent nodes. I keep separate counters for RAVE and non-RAVE simulations.

_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Re: [computer-go] Representation of state-action pairs

Reply via email to