[computer-go] Greedy search vs UCT

Magnus Persson Thu, 24 Apr 2008 08:28:18 -0700

I have checked if there is a difference for Valkyria in usingconfidence bounds or just greedily search the move with the highestwinrate. This is Valkyria 3.2.0 using 512 simulations per move againstGnuGo 3.7.10.


UCT_K   Winrate SERR
0       58.8    2.2 (greedy)
0.01    56.8    2.2
0.1     60.9    2.2
0.5     54.2    2.2
1       50.6    2.2

As you can see up to uct_k = 0.1, the winrate aginst gnugo is more orless constant (500 games was played for each value of uct_k) and thenit declines.

So although 0.1 was best I cannot claim that it is better than a plaingreedy search.

I will repeat this using 4 times as many simulations per move. Thesearch sensitivity to uct_k may depend on how deep the tree issearched.


-Magnus


_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

[computer-go] Greedy search vs UCT

Reply via email to