I would be very interested to see the RAVE code?from Valkyria. I'm sure others 
would be too.

- Dave Hillis


-----Original Message-----
From: Magnus Persson <[EMAIL PROTECTED]>
To: computer-go@computer-go.org
Sent: Fri, 28 Nov 2008 8:58 am
Subject: Re: [computer-go] RAVE formula of David Silver (reposted)


This document is confusing, but here is my interpretation of it. And it works 
well for Valkyria. I would really want to see a pseudocode version of it. I 
might post the code I use for Valkyria, but it is probably not the same thing 
so I would probably just increase the confusion if I did...?
?
Quoting Mark Boon <[EMAIL PROTECTED]>:?
?
>?
> What is also not clear to me from the article is how this UCT_RAVE?
> value is used after it's calculated. In plain UCT search you select the?
> node with the highest win/loss+UCT value. How does the virtual win/loss?
> ratio get used in combination with the UCT-RAVE value resulting from?
> formula (14)? Is this explained in the original by Gelly and Silver??
?
The virtual win-visits (which I think you meant and not 'win/loss') ratios 
*are* what is computed in Equation 12. Equation 13 is "standard UCT". You use 
equation 14 instead of equation 13 to select the move to search. For moves that 
are searched a lot Eq14 will finally approach Eq13, since Beta should go 
towards 0.?
?
I think the term RAVE is often used in a confusing manner. Sometimes it just 
means AMAF or as I prefer virtual win-visit ratios, and sometimes RAVE seems to 
be that the algorithm that mixes the AMAF values with normal UCT-values as 
described in the PDF.?
?
-Magnus?
_______________________________________________?
computer-go mailing list?
[EMAIL PROTECTED]
http://www.computer-go.org/mailman/listinfo/computer-go/?

_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Reply via email to