My original chart of numbers was created before it occurred to me that maybe the check whether the move has been played before is not needed. It is of course possible that the check IS needed. I ran a new test after removing that check and the numbers don't look as good:

At    1 playouts per move, the modified version wins 47.8% of the time (±2.2%) 
after 500 games.
At    2 playouts per move, the modified version wins 52.2% of the time (±2.2%) 
after 500 games.
At    4 playouts per move, the modified version wins 56.3% of the time (±2.9%) 
after 300 games.
At    8 playouts per move, the modified version wins 60.3% of the time (±2.8%) 
after 300 games.
At   16 playouts per move, the modified version wins 58.3% of the time (±2.8%) 
after 300 games.
At   32 playouts per move, the modified version wins 57.3% of the time (±2.9%) 
after 300 games.
At   64 playouts per move, the modified version wins 54.3% of the time (±2.9%) 
after 300 games.
At  128 playouts per move, the modified version wins 51.3% of the time (±2.9%) 
after 300 games.
At  256 playouts per move, the modified version wins 57.0% of the time (±2.9%) 
after 300 games.
At  512 playouts per move, the modified version wins 52.7% of the time (±2.9%) 
after 300 games.
At 1024 playouts per move, the modified version wins 46.0% of the time (±2.9%) 
after 300 games.
At 2048 playouts per move, the modified version wins 54.3% of the time (±2.9%) 
after 300 games.
At 4096 playouts per move, the modified version wins 57.7% of the time (±2.9%) 
after 300 games.

Don, did you retain the check in your version that has the weighted AMAF?
What ever you did, could you also add a version to the study that does the 
opposite?


Mark Boon wrote:
What do those number mean exactly? Are those winning percentages against your ref-bot with 1,000 playouts?

I'm not getting the same results as Michael does. I see some improvement, but not nearly as much.

I had to make a few changes however, as I was adding up the results in a different way than your ref-bot. So what I did was implement tracking the AMAF statistics exactly the same way as in your ref-bot. Next I ran a few hundred games, comparing the win/hit ratio of the newly implemented method with my old one to make sure I got the same result. When that checked out I made the modifications as described by Michael. It seems rather straightforward, but after 200 games I only get a win-rate of ~55% with 2,000 playouts instead of the 63% he has. That's a considerable difference.

Possibly I introduced a little bug somewhere... Or I need a larger test-sample.

I do agree with Michael that this idea is more logical and simpler. I had implemented a different way to check which side moved at which location first. I believe it's more efficient but it doesn't lend itself for Michael's method. However, your way of checking seems rather costly as it's a loop of 1/4 N^2 iterations, where N is the number of moves (which we know is 111 in the beginning). With Michael's method this is reduced to 1/2N iterations.

My ref-bot on CGOS is still going. It seems to be always within a few ELO points of your Jref and Cref bots. So I'm going to stop it as I don't see much point in populating CGOS with identical bots. It has played over 500 games, which I think is enough empirical evidence that the implementation is 'good enough'.

    Mark


On 28-okt-08, at 10:54, Don Dailey wrote:

I have empirical evidence that mwRefbot level 4096 (the Michael Williams
enhanced reference bot at 4096 playouts) is stronger than the reference
bot at the same (and even higher levels.)


Rank Name              Elo    +    - games score oppo. draws
   1 mwRefbot-004096  2684   23   22  1529   78%  2140    0%
   2 refbot-008192    2670   21   21  1532   73%  2274    0%
   3 refbot-004096    2645   21   21  1528   71%  2251    0%
   4 refbot-002048    2555   21   21  1530   63%  2237    0%
   5 refbot-001024    2431   23   23  1529   54%  2178    0%
   6 refbot-000512    2253   27   28  1531   47%  2066    0%
   7 refbot-000256    2000   34   35  1529   42%  1932    0%
   8 refbot-000128    1646   42   42  1529   45%  1655    0%
   9 refbot-000064    1290   40   40  1529   50%  1321    0%
  10 refbot-000032     970   40   40  1531   52%  1041    0%
  11 refbot-000016     621   33   32  1529   55%   780    0%
  12 refbot-000008     370   25   25  1529   47%   649    0%
  13 refbot-000004     250   23   22  1529   43%   551    0%
  14 refbot-000002     145   22   22  1529   35%   500    0%
  15 refbot-000001       5   22   23  1528   22%   475    0%
  16 refbot-000000       0   22   23  1531   22%   484    0%

- Don


_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/


_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Reply via email to