Re: [computer-go] How to "properly" implement RAVE?

Mark Boon Sat, 17 Jan 2009 15:29:48 -0800


On Jan 17, 2009, at 5:41 PM, Sylvain Gelly wrote:

For the first difference you mention, as far as I remember it makesa small but significant difference and is one of the main reason Iwas talking about "tricky details".

OK, I ran a test and after 1,000 games with 2K semi-light playouts Iget a winning percentage of 50.6% for your methods vs. mine. Of courseit's possible I made some mistake, but my first impression is it makesno difference which way you do this particular detail.

Your ChooseNode is also quite different from mine, mostly because Ialso still have a UCT component in there. I'll give your method a goone day, just to see if it changes anything.

I've come to understand what you mean by "tricky details", sometimes Isee a big difference in playing strength that I find hard to explaingiven the change(s) I made. Conversely I've been in quite a few caseswhere I thought something would make a difference, only to find out itall didn't matter one bit.

It's also possible that some deficiencies that would be apparent inone implementation, get compensated for in another.

Some examples: David Fotland wrote he does light playouts with just afew patterns but no tactics. I find that using a moderate amount oftactics actually is the biggest contributor to playing strength (saveone or more stones if can't be caught in ladder). However, augmentingpatterns with tactical information I found doesn't help at all, evenwhen disregarding the performance cost. Maybe David uses some patternsto compensate for part of the tactics and relies on the fasterplayouts to compensate for poorer playouts. I'm guessing here, butotherwise I can't imagine why he would forego what otherwise seems tobe a big gain in strength.

I also tried to use ownership maps to modify the RAVE value. RemiCoulom wrote in a paper he used ownership information of up to 63playouts. When I tried something similar it always makes play weaker.Maybe I should use it in a different way, but I haven't stumbled onthe solution yet. When I think of it, AMAF information is alreadysomething very similar to ownership information. So maybe combiningthe two doesn't make much sense.

Lastly, in an earlier UCT bot that I made I gained a lot by initiallyreducing the number of moves and slowly expanding it. After using AMAFit turns out this method hardly gains anything at all anymore.

So the devil is not only in the details, it's also in the combinationof the details.


Mark


_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Re: [computer-go] How to "properly" implement RAVE?

Reply via email to