Re: [computer-go] RAVE formula of David Silver (reposted)

Magnus Persson Fri, 28 Nov 2008 05:59:27 -0800

This document is confusing, but here is my interpretation of it. Andit works well for Valkyria. I would really want to see a pseudocodeversion of it. I might post the code I use for Valkyria, but it isprobably not the same thing so I would probably just increase theconfusion if I did...


Quoting Mark Boon <[EMAIL PROTECTED]>:


What is also not clear to me from the article is how this UCT_RAVE
value is used after it's calculated. In plain UCT search you select the
node with the highest win/loss+UCT value. How does the virtual win/loss
ratio get used in combination with the UCT-RAVE value resulting from
formula (14)? Is this explained in the original by Gelly and Silver?

The virtual win-visits (which I think you meant and not 'win/loss')ratios *are* what is computed in Equation 12. Equation 13 is "standardUCT". You use equation 14 instead of equation 13 to select the move tosearch. For moves that are searched a lot Eq14 will finally approachEq13, since Beta should go towards 0.

I think the term RAVE is often used in a confusing manner. Sometimesit just means AMAF or as I prefer virtual win-visit ratios, andsometimes RAVE seems to be that the algorithm that mixes the AMAFvalues with normal UCT-values as described in the PDF.


-Magnus
_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Re: [computer-go] RAVE formula of David Silver (reposted)

Reply via email to