2009/6/6 <dhillism...@netscape.net>

> > -----Original Message-----
> > From: Don Dailey <dailey....@gmail.com>
> > To: computer-go <computer-go@computer-go.org>
> > Sent: Sat, 6 Jun 2009 5:59 pm
> > Subject: Re: [computer-go] Tweak to MCTS selection criterion
>
> > 2009/6/6 <dhillism...@netscape.net>
>
>> > I think this is one of those design decisions that nobody takes on
>> faith. We all wind up testing it both ways and in > various combinations.
>> >
>> > An additional advantage of using the number of visits is that branches
>> at the root become mathematically
>> > eliminated and can be pruned away. It often also allows the search to be
>> stopped early. It can save a lot of time
>> > for forced moves.
>
>
> > Let me see if I understand what you are saying here.
>
> > If you have set a goal time for thinking of 10 seconds,  and let's say 6
> seconds have progressed,   then it might be
> > possible to stop the search early if you do the math and see that it's
> not possible for any other move to have more
> > visits given an additional 4 seconds?
>
> > So when there is a position that has only a single clearly best move,
> perhaps a capture that cannot wait or a
> > necessary atari defense,  then you can probably save as much (or close
> to) as half the thinking time on that move.
> >    Is this correct?
>
> Yes. (Although, in practice, it's lucky to shave a third off the time.)
> There is an additional, hard to qualify, benefit. Long before the winning
> move has it locked up, some moves may be mathematically eliminated, so any
> playouts through them would be totally wasted (assuming we can ignore RAVE).
>
> > So if this understanding is correct, then it still makes sense to do the
> "pebbles" test at this point and check to see
> > if another move has a higher score before stopping the search.    If the
> move really is forced and necessary,  then
>  > the answer will be no and you can stop early.  If there is a move that
> currently appears better but with a "too small"
> > sample,  then it seems foolish to stop early.    If the move is a result
> of just a few lucky playouts, then it will quickly
> > be revealed and you can still stop early.
>
> Sounds plausible... But I've tested this. You've tested this. Any benefit
> seems to be pretty hard to measure.
>

I had the early stop rule and didn't know if anyone else had thought of
it.   But I never considered the pebbles rule, which somewhat conflicts with
the early stop rule.   But as I layed out above I think you could do both.

This is probably one of those things that adds a little bit but is difficult
to measure.

- Don




>
>
> - Dave Hillis
>
>
>
>
>
> ------------------------------
> Wanna slim down for summer? Go to America Takes it 
> Off<http://www.aolhealth.com/diet/weight-loss-program/?ncid=emlcntusheal00000001>to
>  learn how.
>
> _______________________________________________
> computer-go mailing list
> computer-go@computer-go.org
> http://www.computer-go.org/mailman/listinfo/computer-go/
>
_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Reply via email to