Re: [computer-go] Time weighting in opening

Jason House Sat, 23 May 2009 14:57:49 -0700

How have you tested your time management code? CGOS is very bad fortesting time management because it gives a gift of time on every move(to compensate for assumed network lag)

I think you might be missing a factor of two in your computations.Only half the moves in a game count against your time.


Sent from my iPhone

On May 23, 2009, at 4:26 PM, Christian Nentwich <christ...@modeltwozero.com> wrote:

This time management business is quite interesting. I looked intothis in some detail a while ago and came up with something I thinkis reasonable for 9x9. I'd love to hear what you all think about it.
My algorithm relies on two key parameters: the time left (which iseither reported by a server periodically, or maintained by theengine), and an estimate of how many moves are left. The estimate ofmoves left is set to 1.6 * board area (i.e. 9 x 9 x 1.6) initiallybased on the average length of playouts in experiments. Towards theend of the game, especially with Tromp Taylor rules, the algorithminstead counts the number of empty intersections left, plus ahaircut for captures. This is usually quite accurate.
So, given the time left, T, and the number of estimated moves left,M, the task is to find out how much time to spend on the currentmove. We know we want to spend (a lot) more on early moves, and lesslater.
Now assume you have moves numbered along the x axis, from 1 to M,and the y axis shows how much time to spend on a move. I used adownward sloping curve with the following form: 1 / x ^ (1 / n)where 'n' controls the steepness of the curve. We know the totalarea under the curve *must* be equal to T, so that you provablynever run out of time given your estimated number of moves.
Integrating over dx and some algebra gives (remember n is asteepness constant, M is the number of moves left, T is time left):
time(current move) = T * (n - 1) / (n * (M ^ (n-1 / n) - 1)
Add a haircut of 5-10%, just in case of network funnies. Works verynicely for me, at least as far as time management is concerned, mycode is not strong yet but it never loses on time :-) Plus, it getsto spend super-linear time in the beginning. If you plot the initialcurve equation, you can see how it works.
Christian



On 23/05/2009 18:38, Don Dailey wrote:
On Sat, May 23, 2009 at 12:34 PM, Brian Sheppard<sheppar...@aol.com> wrote:
>My general impression (also based on experiences from chess):
>Distributing time rather balanced over the moves is a stable
>strategy.
I have found in Chess that you also want to spend more time upfront. Part, but not all of the reason for this is that you don'tknow how long a game will last and you do not want to be on thelosing end of a short game where you have a lot of time left.This by itself makes early moves more important. Also, earlydecisions shape the game more than later decisions.
In 9x9 GO I have found that it's VERY beneficial to spend more timeon early moves. This seem to be more true than in chess. Ithink it is because the early game is much harder to play than theending and you don't want to have a lot of time built up playingeasy moves.
Like everything else, the trick is to find the right balance.With 19x19, time allocation is probably more difficult.
With sudden death time controls, a reasonable algorithm is to setsome percentage of the remaining time on the clock as your goaltime. For chess I have used numbers like 1/30th of the remainingtime. In my opinion the number should be a low estimate on howmany moves you expect to have to make. Although games can bereally short or really long, in general you expect that most gameswill take at least about 30 moves and not exceed 60 or 70 moves.
This does not allocate time evenly, which is good. Each move willbe played slightly faster than the previous. But it will NEVERrun out of time either, at least mathematically since there isalways some time left over. This fraction can be tuned of courseto your comfort level. I remember one older program used 1/60 buta couple of years later the author reported to me that it was waytoo high. This was a program that dominated computer chess for afew years.
You can get a feel for this by just doing the math to see how muchtime you would have for an unusually short game or an unusuallylong game. If your program supports multiple board sizes you picka divisor that is some function of the board size, such as 1 /(N*N) (which is probably far too conservative.) So perhaps 1 /((N*N) * 0.6) where you tune the 0.6 constant.
So I'm saying that this is good in Chess and I believe based onBrian's comments and my own experience that it is ESPECIALLY goodin GO.
- Don





Reasoning on the basis of experience in chess is OK, but you must
account for the differences between the domains.

Chess is more or less uniformly difficult across the whole game.
Go is not. It is definitely more difficult in the opening, especially
for MCTS programs. Trials take longer in the opening, and the
variance is larger, and the differences between moves is smaller
(usually) which means that fewer moves are obviously forced. You have
to spend more time on early moves in MCTS Go programs.
Pebbles calculates the time required to uniformly spread theremaining
time over the game. It then *doubles* that amount, and allocates that
much time for the current play. This policy is not as extreme as you
might think; it results in more-or-less uniform numbers of trials
across the whole game. I have some experimental evidence thatsuggeststhat doubling is not enough. Perhaps the optimum multiplier is 2.5or 3.
Now, this usually does not result in having to play blitz moves later
in the game. (It can happen, if the opponent drags out a losingeffort
into 100+ turns, but that doesn't matter.)

Mogo might have gone too far, but maybe not. There are a lot of ways
to lose games.

Best,
Brian

_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/


_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/
_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Re: [computer-go] Time weighting in opening

Reply via email to