Re: [computer-go] Re: computer-go Digest, Vol 43, Issue 8

Jason House Sun, 10 Feb 2008 10:14:57 -0800

On Sun, 2008-02-10 at 18:35 +0100, Olivier Teytaud wrote:
> > That translates to mean that MoGo no longer uses upper confidence
> > bounds, and only uses means.  It also means that MoGo will _never_
> > explore improbable children (after a few sims) unless the RAVE value
> > yields an unusually high estimate for it.  Is all of that correct?
> >
> 
> Precisely: I don't see why you would be wrong, but empirically for 9x9,
> we have played games against high-level humans and for the (few :-) )
> games that mogo lost, we tried to see which moves were erroneously chosen
> by mogo; if we restart mogo at the same position with a huge 
> computation time (30 minutes of a fast octocore) mogo always changed his
> mind and moves to a better move.


I'm just surprised to hear that the program that introduced UCT (and got
so many others to use it), isn't using UCT any more.  Combining RAVE and
UCT as described in the PDF still sounds like UCT to me, but with no
sqrt(log) term, it no longer is.  I'll certainly have to think about the
trades being made and what I'd expect the outcome to be.

4 hours of CPU time seems like a really long time.  Have there been any
trades done to measure how long MoGo takes to change its mind under
different configurations?


> 
> So:
> - theoretically, I don't see any reason for mogo to be asymptotically
>    consistent

Asymptotically approaching perfect play is no longer a goal?
(Rhetorical question to show some concern)

> - there are long computation times during which mogo focuses on a bad
>    move
> - however, we have not seen a case of bad move for which mogo keeps
>    this move in case of _very_ long computation times
> 
> ==> if someone beats the release MoGoR3 with
>       very large computation times (time x nbcores = 4h, 1 to 4 cores)
>       I'm interested in the sgf file and the analysis

I bet someone will take up the challenge.  You should probably also give
a limit to how long the human can think about the game.  Can they also
sit there for 4 hours contemplating a move and working out all the
variations?  Of course, this challenge by itself shows the ultimate
effect of increased computational power...

_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Re: [computer-go] Re: computer-go Digest, Vol 43, Issue 8

Reply via email to