Olivier Teytaud: <[EMAIL PROTECTED]>:
>> That translates to mean that MoGo no longer uses upper confidence
>> bounds, and only uses means.  It also means that MoGo will _never_
>> explore improbable children (after a few sims) unless the RAVE value
>> yields an unusually high estimate for it.  Is all of that correct?
>>
>
>Precisely: I don't see why you would be wrong, but empirically for 9x9,
>we have played games against high-level humans and for the (few :-) )
>games that mogo lost, we tried to see which moves were erroneously chosen
>by mogo; if we restart mogo at the same position with a huge 
>computation time (30 minutes of a fast octocore) mogo always changed his
>mind and moves to a better move.

Could we look at some of the records of the games?

-Hideki

>So:
>- theoretically, I don't see any reason for mogo to be asymptotically
>   consistent
>- there are long computation times during which mogo focuses on a bad
>   move
>- however, we have not seen a case of bad move for which mogo keeps
>   this move in case of _very_ long computation times
>
>==> if someone beats the release MoGoR3 with
>       very large computation times (time x nbcores = 4h, 1 to 4 cores)
>       I'm interested in the sgf file and the analysis
>_______________________________________________
>computer-go mailing list
>computer-go@computer-go.org
>http://www.computer-go.org/mailman/listinfo/computer-go/
--
[EMAIL PROTECTED] (Kato)
_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Reply via email to