Re: [computer-go] FW: computer-go] Monte carlo play?

Magnus Persson Sun, 16 Nov 2008 05:45:34 -0800

Quoting Hideki Kato <[EMAIL PROTECTED]>:

Heikki Levanto: <[EMAIL PROTECTED]>:

The way I understand it, modern Monte Carlo programs do not even try to
emulate a human player with a random player - obviously that would not work.


I believe CrazyStone's use of patterns does so and it seems
successful.


With Valkyria I try to follow two principles in heavy playouts.

1) In contact fights there are a lot of shapes that are played most ofthe time. Thus Valkyria checks each move played if there is an obviouslocal response to it. If so it plays it deterministcally. In manysituations there are two or more such candidates and then it plays oneof those moves.

2) In many positions the last move played does not trigger any obviousresponse, and then a random move is chosen uniformly

3) There are moves that are inferior 100% of the time both locally andglobally. These moves are pruned if they are selected and a new randommove is chosen as long as there are moves left to try.

I got hundreds of handcoded patterns for both 1 and 3. It would be tootime consuming to test these patterns, so I use my knowledge andintuition (European 2 Dan) to simply decide what patterns to include.

So Valkyria has a lot of go knowledge, but mostly such knowledge thatall go players have up to some strength such as perhaps 8-10 kyu. Ithas no knowledge about global matters. The beauty of MC-evaluation isthat globally strong moves are most of the time evaluated better thanglobally weak moves. Heavy playouts removes noise from MC-evaluationand makes it more sensitive to the true value of moves. Still thereare biases with all heavy playouts, but they are overcome with MC TreeSearch (MCTS) that corrects mistakes in the evaluation recursively.


Here are my latest scaling experiment on 9x9 for Valkyria.

Valkyria plays 1150 random games per second on my 4 year old laptop.

This test is against gnugo 3.7.10 assumed to be Elo 1800. Mostdatapoints are based on 500 games. "N sims" means Valkyria playes Nheavy playouts per move played. Winrates are in %.


N sims  WinRate Elo (rel Gnu)
47      7.4     1361
94      22      1580
188     37      1708
375     53      1821
750     69.9    1946
1500    81.2    2054
3000    88      2146
6000    92.6    2239
12000   94      2278
24000   97.2    2416
48000   97.4    2429

the heavy playouts of Valkyria needs just 375 random games per move tomatch gnugo using only 0.3 seconds per move. And even using only 47simulations per move it can still win.

So obviously the heavy playout code of Valkyria is much weaker (< Elo1361) than Gnugo and most human opponents, but compared to CGOS a lotof programs witho no knowledge are about the same level, although theyuses 2000 simulations or more.

Searching efficiently using MCTS with AMAF it apparently can be madearbitrarily strong.

Hope this explains how both the nature of playouts and the MCTScontributes to the playing strength of a program.

Should one go heavy or light? I do not know, I feel that Valkyria is alittle bit too slow on equivalent hardware against most top programs.On the other hand I think it could be tweaked and improved upon.Perhaps it can even be made faster by removing code that does notimprove playing strength. And there is probably still room for addingcode that improves strength without a noticable slowdown.


I just know that is a lot of hard work doing it the way I did it.

Best
Magnus
_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Re: [computer-go] FW: computer-go] Monte carlo play?

Reply via email to