Hi,

On Sat, Mar 08, 2008 at 10:18:34AM +0100, Petr Baudis wrote:
>   (By the way, pachi1-*-light are UCT bots with completely light
> playouts with various UCB1 c values, if anyone wants to use that as
> reference. Surprisingly, it seems that my heavy playouts do not make big
> difference so far, though the rating is still very unstable.)

  after two days of play, it seems the ratings are fairly settled now.
For clarity, here is the UCB1 formula I use:

        UCB1 = X_i + sqrt(log(N) * c / n)

Specifically, the c is withing the sqrt(); some of the papers put it in
front of the sqrt.

  Also, I expand UCT leaves at the second hit. This retains conservative
memory usage but it is important for strength - I saw huge strength
increase when I lowered this to 2 from the original value of 5.

  With 110k playouts per move and no domain knowledge in the playouts,
the ratings are now:

        c=0.2  (pachi1-p0.2-light)      ELO 1627 (285 games)
        c=1.0  (pachi1-p1.0-light)      ELO 1590 (120 games)
        c=0.05 (pachi1-p0.05-light)     ELO 1531 (286 games)
        c=2.0  (pachi1-p2.0-light)      ELO 1511 (118 games)

  The main two messages of this post are: If you are developing own UCT
bot, with this number of playouts you should be aiming at least at 1600
ELO on CGOS. And choosing the right c can easily make a 100 ELO
difference! In particular, the "default" UCB1 c=2.0 appears to be very
unsuitable choice.

  I'm pretty sure my code is fairly well debugged now, but of course
there may be still bugs lurking; when I have put my bots on CGOS for the
first time it was awfully bug-ridden (and about 800 ELO worse ;-). What
ELO rating did pure UCT bots get historically with how many playouts?


P.S.: Looks like the heavy playouts I described in my other mail bring
no improvement to the bot strength at all, and mostly make it few ELO
weaker. :-( I'm rethinking my approaches now.

-- 
                                Petr "Pasky" Baudis
Whatever you can do, or dream you can, begin it.
Boldness has genius, power, and magic in it.    -- J. W. von Goethe
_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Reply via email to