Re: [computer-go] MC approach

Hideki Kato Wed, 07 Feb 2007 21:29:57 -0800

Matt Gokey: <[EMAIL PROTECTED]>:
>Weston Markham wrote:
>
>> But of course, it's not the size of the win that counts, it is rather
>> the confidence that it really is a win.
>Yes, and my reasoning was that a larger average win implied a higher 
>confidence since there is more room for error.  That intuition may not 
>hold though.
> > In random playouts that
>> continue from a position from a close game, the ones that result in a
>> large victory are generally only ones where the opponent made a severe
>> blunder.  (Put another way, the score of the game is affected more by
>> how bad the bad moves are, rather than how good the good ones are, or
>> even how good most of the moves are.  Others have commented on this
>> effect in this list, in other contexts.)  Since you can't count on
>> that happening in the real game, these simulations have a lower value
>> in the context of ensuring a win.
>That is the first reasonable argument I've heard that makes some sense 
>as to why this effect may be true.  The opposite of course may be true 
>as well and close games may really not be close due to the same blunder 
>effect.  Perhaps it is just another symptom of the fact that most 
>playouts are nonsense games.


We can test this effect by using, for example,
    v = 0.5 * (1.0 + tanh (k * score)); // v is in [0...1].
with a little penalty of simulation speed. As k being lager, this 
function closes to commonly used threshold function, and vice versa.

I guess the best value of k depends on the sensefulness of the games, 
ie., current heuristics for pruning moves are not so effective that 
larger k is the best.

- gg
--
[EMAIL PROTECTED] (Kato)
_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Re: [computer-go] MC approach

Reply via email to