Re[4]: [computer-go] UCT vs MC

Dmitry Kamenetsky Wed, 21 Feb 2007 15:40:42 -0800

Thank you Don and Sylvain. I now understand this issue completely.

One more question. Line 23 states: for i:=node.size()-2 to 0 do. The leaf node 
should be stored in node[node.size()-1], so why do we start at node.size()-2? 
Is it not necessary to update the value of the leaf node?


-----Original Message-----
From: Don Dailey <[EMAIL PROTECTED]>
To: Dmitry Kamenetsky <[EMAIL PROTECTED]>, computer-go 
<computer-go@computer-go.org>
Date: Wed, 21 Feb 2007 12:54:43 -0500
Subject: Re: Re[2]: [computer-go] UCT vs MC

> 
> On Wed, 2007-02-21 at 16:56 +0300, Dmitry Kamenetsky wrote:
> > Thank you for your answer. However, I am even more confused now. I
> > understand that "-" is for negamax, but I don't understand why it
> > became "1-". I am trying to implement your algorithm and I just want
> > to know what lines 7, 16 and 26 should be?
> 
> I'm not sure this is what you are looking for, but in negamax,  scores
> can be negative or positive.   The scores are always adjusted so that
> you can view positive numbers as "good" and negative as "bad" from the
> point of view you are referencing.   So to get the score from the
> "other"
> point of view you simple negate it.
> 
> But in UCT, we don't deal with negative numbers.  A score is between
> 0 and 1,  so 0.001 is almost losing and 0.999 is almost winning for
> example.
> 
> To change 0.99 to the other players point of view in this system, where
> scores must be between 0 and 1,  you must negate it and add 1.   So 0.99
> becomes:   1 - 0.99 = 0.01  
> 
> I hope that is what you are asking about and  that this explains it.
> 
> - Don
>  
> 
_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Re[4]: [computer-go] UCT vs MC

Reply via email to