Hi, On Sat, Mar 08, 2008 at 10:18:34AM +0100, Petr Baudis wrote: > (By the way, pachi1-*-light are UCT bots with completely light > playouts with various UCB1 c values, if anyone wants to use that as > reference. Surprisingly, it seems that my heavy playouts do not make big > difference so far, though the rating is still very unstable.)
after two days of play, it seems the ratings are fairly settled now. For clarity, here is the UCB1 formula I use: UCB1 = X_i + sqrt(log(N) * c / n) Specifically, the c is withing the sqrt(); some of the papers put it in front of the sqrt. Also, I expand UCT leaves at the second hit. This retains conservative memory usage but it is important for strength - I saw huge strength increase when I lowered this to 2 from the original value of 5. With 110k playouts per move and no domain knowledge in the playouts, the ratings are now: c=0.2 (pachi1-p0.2-light) ELO 1627 (285 games) c=1.0 (pachi1-p1.0-light) ELO 1590 (120 games) c=0.05 (pachi1-p0.05-light) ELO 1531 (286 games) c=2.0 (pachi1-p2.0-light) ELO 1511 (118 games) The main two messages of this post are: If you are developing own UCT bot, with this number of playouts you should be aiming at least at 1600 ELO on CGOS. And choosing the right c can easily make a 100 ELO difference! In particular, the "default" UCB1 c=2.0 appears to be very unsuitable choice. I'm pretty sure my code is fairly well debugged now, but of course there may be still bugs lurking; when I have put my bots on CGOS for the first time it was awfully bug-ridden (and about 800 ELO worse ;-). What ELO rating did pure UCT bots get historically with how many playouts? P.S.: Looks like the heavy playouts I described in my other mail bring no improvement to the bot strength at all, and mostly make it few ELO weaker. :-( I'm rethinking my approaches now. -- Petr "Pasky" Baudis Whatever you can do, or dream you can, begin it. Boldness has genius, power, and magic in it. -- J. W. von Goethe _______________________________________________ computer-go mailing list computer-go@computer-go.org http://www.computer-go.org/mailman/listinfo/computer-go/