Re: [computer-go] Michael Williams enhancement

Michael Williams Tue, 28 Oct 2008 08:17:18 -0700

My original chart of numbers was created before it occurred to me that maybe the check whether the move has been played before is not needed. It is of coursepossible that the check IS needed. I ran a new test after removing that check and the numbers don't look as good:


At    1 playouts per move, the modified version wins 47.8% of the time (±2.2%) 
after 500 games.
At    2 playouts per move, the modified version wins 52.2% of the time (±2.2%) 
after 500 games.
At    4 playouts per move, the modified version wins 56.3% of the time (±2.9%) 
after 300 games.
At    8 playouts per move, the modified version wins 60.3% of the time (±2.8%) 
after 300 games.
At   16 playouts per move, the modified version wins 58.3% of the time (±2.8%) 
after 300 games.
At   32 playouts per move, the modified version wins 57.3% of the time (±2.9%) 
after 300 games.
At   64 playouts per move, the modified version wins 54.3% of the time (±2.9%) 
after 300 games.
At  128 playouts per move, the modified version wins 51.3% of the time (±2.9%) 
after 300 games.
At  256 playouts per move, the modified version wins 57.0% of the time (±2.9%) 
after 300 games.
At  512 playouts per move, the modified version wins 52.7% of the time (±2.9%) 
after 300 games.
At 1024 playouts per move, the modified version wins 46.0% of the time (±2.9%) 
after 300 games.
At 2048 playouts per move, the modified version wins 54.3% of the time (±2.9%) 
after 300 games.
At 4096 playouts per move, the modified version wins 57.7% of the time (±2.9%) 
after 300 games.


Don, did you retain the check in your version that has the weighted AMAF?
What ever you did, could you also add a version to the study that does the 
opposite?


Mark Boon wrote:

What do those number mean exactly? Are those winning percentages againstyour ref-bot with 1,000 playouts?
I'm not getting the same results as Michael does. I see someimprovement, but not nearly as much.
I had to make a few changes however, as I was adding up the results in adifferent way than your ref-bot. So what I did was implement trackingthe AMAF statistics exactly the same way as in your ref-bot. Next I rana few hundred games, comparing the win/hit ratio of the newlyimplemented method with my old one to make sure I got the same result.When that checked out I made the modifications as described by Michael.It seems rather straightforward, but after 200 games I only get awin-rate of ~55% with 2,000 playouts instead of the 63% he has. That's aconsiderable difference.
Possibly I introduced a little bug somewhere... Or I need a largertest-sample.
I do agree with Michael that this idea is more logical and simpler. Ihad implemented a different way to check which side moved at whichlocation first. I believe it's more efficient but it doesn't lend itselffor Michael's method. However, your way of checking seems rather costlyas it's a loop of 1/4 N^2 iterations, where N is the number of moves(which we know is 111 in the beginning). With Michael's method this isreduced to 1/2N iterations.
My ref-bot on CGOS is still going. It seems to be always within a fewELO points of your Jref and Cref bots. So I'm going to stop it as Idon't see much point in populating CGOS with identical bots. It hasplayed over 500 games, which I think is enough empirical evidence thatthe implementation is 'good enough'.
    Mark


On 28-okt-08, at 10:54, Don Dailey wrote:
I have empirical evidence that mwRefbot level 4096 (the Michael Williams
enhanced reference bot at 4096 playouts) is stronger than the reference
bot at the same (and even higher levels.)


Rank Name              Elo    +    - games score oppo. draws
   1 mwRefbot-004096  2684   23   22  1529   78%  2140    0%
   2 refbot-008192    2670   21   21  1532   73%  2274    0%
   3 refbot-004096    2645   21   21  1528   71%  2251    0%
   4 refbot-002048    2555   21   21  1530   63%  2237    0%
   5 refbot-001024    2431   23   23  1529   54%  2178    0%
   6 refbot-000512    2253   27   28  1531   47%  2066    0%
   7 refbot-000256    2000   34   35  1529   42%  1932    0%
   8 refbot-000128    1646   42   42  1529   45%  1655    0%
   9 refbot-000064    1290   40   40  1529   50%  1321    0%
  10 refbot-000032     970   40   40  1531   52%  1041    0%
  11 refbot-000016     621   33   32  1529   55%   780    0%
  12 refbot-000008     370   25   25  1529   47%   649    0%
  13 refbot-000004     250   23   22  1529   43%   551    0%
  14 refbot-000002     145   22   22  1529   35%   500    0%
  15 refbot-000001       5   22   23  1528   22%   475    0%
  16 refbot-000000       0   22   23  1531   22%   484    0%

- Don


_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/
_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/


_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Re: [computer-go] Michael Williams enhancement

Reply via email to