I honestly think there are better ways to handle this, if you must,
other than changing the goal to a losing goal.     At least give the
computer the right goal (winning) and adjust from there.  

If I were trying to solve this "problem",   the solution I would look at
first  would be to pre-process the moves in advance of the search,   and
just impose a very slight bias on the root move list.     The idea is
that all other things being equal,  it will play a patterned or "normal"
move instead of a random move when the game is virtually over.  

If you do it right,  you can probably make it play more like like it has
an ego without making it weaker.

- Don



Petr Baudis wrote:
> On Fri, Mar 07, 2008 at 08:04:37AM -0500, Thomas Wolf wrote:
>   
>> I assume that when you change komi dynamically, all that was learned
>> by MC so far under the different komi value is useless/wrong.
>>     
>
> But what are actually your reuse rates? With the standard UCB1 formula,
> I find reusing branches from earlier trees give relatively insignificant
> number of extra playouts, at least with p=0.2 (I find that using lower p
> values leads to very deep UCT trees, but the choice of moves being read
> out is _very_ noisy). Maybe this is much better with RAVE or such?
>
>   
_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Reply via email to