Re: [computer-go] Tweak to MCTS selection criterion

2009-06-08 Thread Mark Boon
I did this, stopping search if other moves mathematically couldn't catch up. I found that the savings in percentage of total nodes depended on how many playouts the program did. The larger the number of playouts, the larger the savings. I also made a rule where after a certain percentage of the tot

Re: [computer-go] Tweak to MCTS selection criterion

2009-06-07 Thread dhillismail
Subject: Re: [computer-go] Tweak to MCTS selection criterion The policy is only risk free on a per-move basis. For the game as a whole, using more time on one move means there will be less time available for later moves. Back when I tested this rule in my own program, I didn't see any signif

Re: [computer-go] Tweak to MCTS selection criterion

2009-06-07 Thread dhillismail
illis -Original Message- From: Brian Sheppard To: computer-go@computer-go.org Sent: Sun, 7 Jun 2009 6:05 pm Subject: [computer-go] Tweak to MCTS selection criterion > check if overtaking the leading move is mathematically impossible? Yes. Pebbles does this. Please note that the discussion ha

Re: [computer-go] Tweak to MCTS selection criterion

2009-06-07 Thread Darren Cook
> It's interesting with UCT because of the interplay between the time > management algorithm and the exploration parameter. Suppose you are > early in the game, and your time management algorithm says you should be > spending 10 seconds on a move. After six seconds, because your parameter > is skew

Re: [computer-go] Tweak to MCTS selection criterion

2009-06-07 Thread Don Dailey
On Sun, Jun 7, 2009 at 6:05 PM, Brian Sheppard wrote: > > check if overtaking the leading move is mathematically impossible? > > Yes. Pebbles does this. > > Please note that the discussion has veered into time control policy, > which is not the subject of the original post. The original post di

[computer-go] Tweak to MCTS selection criterion

2009-06-07 Thread Brian Sheppard
> check if overtaking the leading move is mathematically impossible? Yes. Pebbles does this. Please note that the discussion has veered into time control policy, which is not the subject of the original post. The original post discusses move selection policy: when your program stops, for whateve

RE: [computer-go] Tweak to MCTS selection criterion

2009-06-07 Thread David Fotland
If the > > move is a result of just a few lucky playouts, then it will quickly be > > revealed and you can still stop early. > > > > - Don > > > > > > > > > > > > - Dave Hillis > > > > > > > > -Origin

Re: [computer-go] Tweak to MCTS selection criterion

2009-06-07 Thread Don Dailey
2009/6/6 > > I had the early stop rule and didn't know if anyone else had thought of > it. But I never considered the pebbles rule, > > which somewhat conflicts with the early stop rule. But as I layed out > above I think you could do both. > > > This is probably one of those things that adds

Re: [computer-go] Tweak to MCTS selection criterion

2009-06-07 Thread Magnus Persson
I think all principles I use in the time management for Valkyria came up in this thread more or less. 1) Valkyria selects move that has been searched the most. 2) It is given a base time for example 20 seconds early on on 9x9 CGOS 3) The base time is adjusted down if the program is winning big.

Re: [computer-go] Tweak to MCTS selection criterion

2009-06-07 Thread Christian Nentwich
it will quickly be revealed and you can still stop early. - Don - Dave Hillis -Original Message- From: Michael Williams mailto:michaelwilliam...@gmail.com>> To: computer-go mailto:computer-go@computer-go.org>> Sent: Sat, 6 Jun 2009 5:07 pm

Re: [computer-go] Tweak to MCTS selection criterion

2009-06-06 Thread dhillismail
> I had the early stop rule and didn't know if anyone else had thought of it.?? > But I never considered the pebbles rule, > which somewhat conflicts with the early stop rule.?? But as I layed out above > I think you could do both. > This is probably one of those things that adds a little bit b

Re: [computer-go] Tweak to MCTS selection criterion

2009-06-06 Thread Don Dailey
2009/6/6 > > -Original Message- > > From: Don Dailey > > To: computer-go > > Sent: Sat, 6 Jun 2009 5:59 pm > > Subject: Re: [computer-go] Tweak to MCTS selection criterion > > > 2009/6/6 > >> > I think this is one of those design deci

Re: [computer-go] Tweak to MCTS selection criterion

2009-06-06 Thread dhillismail
> -Original Message- > From: Don Dailey > To: computer-go > Sent: Sat, 6 Jun 2009 5:59 pm > Subject: Re: [computer-go] Tweak to MCTS selection criterion > 2009/6/6 > I think this is?one of those?design decisions that nobody takes on faith. We > all wind up tes

Re: [computer-go] Tweak to MCTS selection criterion

2009-06-06 Thread Don Dailey
ll quickly be revealed and you can still stop early. - Don > - Dave Hillis > > > > -Original Message- > From: Michael Williams > To: computer-go > Sent: Sat, 6 Jun 2009 5:07 pm > Subject: Re: [computer-go] Tweak to MCTS selection criterion > > Another str

Re: [computer-go] Tweak to MCTS selection criterion

2009-06-06 Thread dhillismail
allows the search to be stopped early. It can save a lot of time for forced moves. - Dave Hillis -Original Message- From: Michael Williams To: computer-go Sent: Sat, 6 Jun 2009 5:07 pm Subject: Re: [computer-go] Tweak to MCTS selection criterion Another strategy to be considered is to

Re: [computer-go] Tweak to MCTS selection criterion

2009-06-06 Thread Don Dailey
On Sat, Jun 6, 2009 at 5:07 PM, Michael Williams < michaelwilliam...@gmail.com> wrote: > Another strategy to be considered is to not allow the thinking to cease > until the maximum win rate and the maximum visit count agree on the same > move. Obviously this requires some extra code to make sure y

Re: [computer-go] Tweak to MCTS selection criterion

2009-06-06 Thread Michael Williams
Another strategy to be considered is to not allow the thinking to cease until the maximum win rate and the maximum visit count agree on the same move. Obviously this requires some extra code to make sure you don't lose on time, etc. Brian Sheppard wrote: When a UCT search is completed, the usua

[computer-go] Tweak to MCTS selection criterion

2009-06-06 Thread Brian Sheppard
When a UCT search is completed, the usual selection criterion is "choose the move that has the most trials." This is more stable than choosing the move that has the highest percentage of wins, since it is possible to have an unreliably high percentage if the number of trials is small. I have a sma