On Fri, 2007-04-13 at 19:52 +0900, Darren Cook wrote:
> >   http://greencheeks.homelinux.org:8015/~drd/public/study.jpg
> > ...
> > I'm actually testing 2 programs - both of them UCT style go
> > programs, but one of those programs does uniformly random
> > play-outs and the other much stronger one is similar to
> > Mogo, as documented in one of their papers.
> 
> Hi Don,
> Can you describe the implementation of heavy and lite in more detail;
> especially lite?
> 
> I've been trying the libego program "out of the box", and am up to
> 200,000 UCT playouts, but still gnugo 3.6 on level 6 is winning 10 out
> of 10. According to your chart, lite with 128K playouts should be about
> 50 ELO points higher than gnugo 3.7 (on level 10 I assume?). (256K
> playouts should be 130 points higher.)
> 
> I'm wondering if there is a standard set of UCT enhancements that
> everyone is doing?
> 
> TIA,
> 
> Darren

If 200,000 play-outs is being beat, something is broken.  Even 
a bad implementation should do better than that.   

Does libego provide a full UCT implementation out of the box?  

The lite version I am testing has one enhacement that makes it not
uniformly random.   There is code to make it prefer captures in a
probabilistic way.   I think I just test N moves, and if one of them
is a capture I play it, otherwise I choose a move randomly which doesn't
fill an eye.   But this enhancement isn't needed to get it up to the
level of gnugo 3.6

There are issues with things like suicide, eye-filling and so on that
can effect the strength - different people claim different things
about this.   I never allow suicide, I test only for simple ko in
the play-outs.  I don't know how much impact variations of these
things have.   

One thing that I think most people are doing is related to how you
select a move once you have finished the search.   Do you pick the
highest scoring move?   Sometimes the highest scoring move has very
few simulations in that part of the tree and the simplest and safest
thing to do is pick the node that has the highest number of 
simulation under it.   This move is usually the highest scoring move,
or has been the highest scoring and probably will be again.   Another
thing you can do is to delay the search to see if the new move sticks.

That's the only thing I would consider an enhancement - I don't know
how much strength it contributes but it does help.

- Don








 




> _______________________________________________
> computer-go mailing list
> [EMAIL PROTECTED]
> http://www.computer-go.org/mailman/listinfo/computer-go/

_______________________________________________
computer-go mailing list
[EMAIL PROTECTED]
http://www.computer-go.org/mailman/listinfo/computer-go/

Reply via email to