Re: [computer-go] KO in Hashtable-UCT?

2007-05-17 Thread Chrilly
> I have serious problems with KO. UCT-Suzie plays generally strong, but makes terrible blunders in KO-positions. So far I do not even understand the problem. Could you describe it more detailed? I had also some serious SuperKO problems. UCT-Suzie was very "clever" to find SuperKOs. We do not ch

Re: [computer-go] JCIS extended abstract

2007-05-17 Thread Jason House
Chaslot G (MICC) wrote: p_hat = (w_i + n_h*H_B)/(n_i+n_h) Interesting... But then how do you compute n_h in practice The mathematical derivation is based on estimating an a-priori probability distribution. In theory, one simply needs to run MC simulations for a wide variety of heur

RE: [computer-go] JCIS extended abstract

2007-05-17 Thread Chaslot G (MICC)
> How do you handle n_i = 0 in equation 1.2? I don't compute it when n_i is not equal to 1. At that point, I give every node a high upper bound in order to visit every node once. Remember that I am using "progressive unpruning" in the same time, so only the best nodes according to the heuristi

Re: [computer-go] Re: Amsterdam 2007 paper

2007-05-17 Thread Jason House
Rémi Coulom wrote: to Magnus: If you consider the example of section 2.2: 1,2,3 wins against 4,2 and 1,5,6,7. The probability is P=c1c2c3/(c1c2c3+c4c2+c1c5c6c7). For this example: N1j=c2c3,B1j=0,M1j=c2c3+c5c6c7,A1j=c4c2 N2j=c1c3,B2j=0 N3j=c1c2,B3j=0 N4j=0,B4j=c1c2c3 I will add this example to t

Re: [computer-go] KO in Hashtable-UCT?

2007-05-17 Thread Chris Fant
On 5/17/07, Chrilly <[EMAIL PROTECTED]> wrote: I have now also finished a first version of UCT-Suzie (in parallel the Peter Woitke works on the Alpha-Beta Version). UCT-Suzie uses a hashtable, mainly because I found the programming of the tree too complicated. The Monte-Carlo part uses some simpl

Re: [computer-go] Re: Amsterdam 2007 paper

2007-05-17 Thread Sylvain Gelly
Hi Rémi, 2007/5/17, Rémi Coulom <[EMAIL PROTECTED]>: to Sylvain: Here are tests of Crazy Stone at 90s/game 1CPU against GNU 3.6 level 10, measured over about 200 games [...] Thank you for your answer. These numbers are interesting. The improvement in the tree search is really huge. It is what

Re: [computer-go] Amsterdam 2007 paper

2007-05-17 Thread Rémi Coulom
Chris Fant wrote: I first thought I would keep my ideas secret until the Asmterdam tournament, but now that I have submitted my paper, I cannot wait to share it. So, here it is: http://remi.coulom.free.fr/Amsterdam2007/ Comments and questions are very welcome. I'd like to propose a potential

Re: [computer-go] KO in Hashtable-UCT?

2007-05-17 Thread Chrilly
I have now also finished a first version of UCT-Suzie (in parallel the Peter Woitke works on the Alpha-Beta Version). UCT-Suzie uses a hashtable, mainly because I found the programming of the tree too complicated. The Monte-Carlo part uses some simple patterns according the MoGo article. Progres

Re: [computer-go] Re: Amsterdam 2007 paper

2007-05-17 Thread Erik van der Werf
On 5/17/07, Rémi Coulom <[EMAIL PROTECTED]> wrote: Álvaro Begué wrote: > There are many things in the paper that we had never thought of, like > considering the distance to the penultimate move. That feature improved the effectiveness of progressive widening a lot. When I had only the distance to

Re: [computer-go] JCIS extended abstract

2007-05-17 Thread Jason House
How do you handle n_i = 0 in equation 1.2? I'd assume that if a heuristic says a move is really bad (H_B = 0), then you'd want to avoid simulations for a while. Also, has mango experimented with other related strategies for a soft transition? In my own mathematical recreation, I came up with a

Re: [computer-go] Amsterdam 2007 paper

2007-05-17 Thread dhillismail
Very interesting paper: many ideas for me to study. And thanks for giving us an early look at it. I'll make one suggestion: in the final version, it deserves some better salesmanship in the results section. I've read through too many papers, only to reach the results section at the end and se

Re: [computer-go] Amsterdam 2007 paper

2007-05-17 Thread Chris Fant
I first thought I would keep my ideas secret until the Asmterdam tournament, but now that I have submitted my paper, I cannot wait to share it. So, here it is: http://remi.coulom.free.fr/Amsterdam2007/ Comments and questions are very welcome. I'd like to propose a potential direction of furthe

Re: [computer-go] Re: Amsterdam 2007 paper

2007-05-17 Thread Magnus Persson
Yes, now I understand. I think what made it hard for me conceptually was that P(Rj) can be rewritten n different ways for each feature ci 1 <= i <= n. I had this problem with your example too. I first thought that the lines with the factors were arbitarary, but then I realized that each line goes

Re: [computer-go] producing a good probability distribution over legal moves

2007-05-17 Thread Don Dailey
On Thu, 2007-05-17 at 12:17 -0400, George Dahl wrote: > Imagine if you had a monte carlo program that took almost no time to > run. You would use it to do "heavy" playouts for another monte carlo > program to make it even stronger. I tried something like this as a test with simple monte carlo.

Re: [computer-go] producing a good probability distribution over legal moves

2007-05-17 Thread Don Dailey
On Thu, 2007-05-17 at 10:54 -0500, Zach Keatts wrote: > What you would have after your training/evaluator phase is a hueristic > knowlege of possibly better montecarlo trees to consider. This will > definitely cut down on the search space, but could also alienate a > strong search path. I have be

Re: [computer-go] Re: Amsterdam 2007 paper

2007-05-17 Thread Rémi Coulom
Álvaro Begué wrote: There are many things in the paper that we had never thought of, like considering the distance to the penultimate move. That feature improved the effectiveness of progressive widening a lot. When I had only the distance to the previous move, and the opponent made a dangerous

Re: [computer-go] Amsterdam 2007 paper

2007-05-17 Thread Chris Fant
Thanks for the great paper. And thanks for sharing it before it's published. Now I know what directions to take my engine in next. Time for Team MoGo so share some more secrets :) ___ computer-go mailing list computer-go@computer-go.org http://www.co

[computer-go] JCIS extended abstract

2007-05-17 Thread Chaslot G (MICC)
Dear all, Following the example of Rémi, I would like to share with you a paper that I wrote which describe some important elements of my Go program Mango. I submitted this paper recently to the JCIS workshop 2007. Due to the fact that it was an extended abstract, a lot of details are missing.

Re: [computer-go] Re: Amsterdam 2007 paper

2007-05-17 Thread Álvaro Begué
Rémi: John and I are planning a similar usage of patterns and features in the MC simulations, although we were thinking of a very different method to update "strengths". Although we derived our formulas from an extension of the logistic regression instead of Bradley-Terry models, we arrived at ve

Re: [computer-go] Re: Amsterdam 2007 paper

2007-05-17 Thread Chris Fant
> It seems that e-mail at my university does not work any more. I have > received none of the replies to my message of yesterday, but I could > read them on the web archives of the list. So I have registered from > another address, and will answer to the questions I have read on the web. In sect

Re: [computer-go] Re: Amsterdam 2007 paper

2007-05-17 Thread Chris Fant
It seems that e-mail at my university does not work any more. I have received none of the replies to my message of yesterday, but I could read them on the web archives of the list. So I have registered from another address, and will answer to the questions I have read on the web. In section 2.3

Re: [computer-go] producing a good probability distribution over legal moves

2007-05-17 Thread Daniel Burgos
But it is very difficult that a board position is repeated between games. I don't see how you will use the training pairs in the new games. 2007/5/17, George Dahl <[EMAIL PROTECTED]>: What I am actually proposing is collapsing the results of the playouts offline and then having a function that

Re: [computer-go] Amsterdam 2007 paper

2007-05-17 Thread Brian Slesinsky
This is very interesting. I've skimmed so far and don't understand everything yet, but I can make some comments on readability. Table 1 takes some study to understand. If I understand it correctly, the Feature column might be more clearly labeled "Feature of Candidate Move." (For example, "Pas

[computer-go] Re: Amsterdam 2007 paper

2007-05-17 Thread Rémi Coulom
Hi, It seems that e-mail at my university does not work any more. I have received none of the replies to my message of yesterday, but I could read them on the web archives of the list. So I have registered from another address, and will answer to the questions I have read on the web. to Matt

Re: [computer-go] producing a good probability distribution over legal moves

2007-05-17 Thread George Dahl
What I am actually proposing is collapsing the results of the playouts offline and then having a function that maps board positions to playout values without actually doing playouts. So I would use an MC player to generate a lot of training pairs of the form (position, score) where position would

[computer-go] Professional Go player at the Computer Olympiad

2007-05-17 Thread Chaslot G (MICC)
Dear all, We are proud to announce that Guo Juan, 5d professional from China, will be present at the Computer Olympiad in Amsterdam on the Sunday 17th of June. She will play against the winner of the Olympiad and comment the most interesting games. We will broadcast as many Go games and comments a

Re: [computer-go] producing a good probability distribution over legal moves

2007-05-17 Thread Zach Keatts
What you would have after your training/evaluator phase is a hueristic knowlege of possibly better montecarlo trees to consider. This will definitely cut down on the search space, but could also alienate a strong search path. I have been thinking along these same line for some time. The problem

Re: [computer-go] producing a good probability distribution over legal moves

2007-05-17 Thread David Doshay
On 17, May 2007, at 8:17 AM, Brian Slesinsky wrote: A weakness of this approach is that sometimes the best move depends on how you plan to follow it up; a program that plays the theoretically best move but doesn't know how to follow it up is weaker than a program that plays safer moves. I have

Re: [computer-go] producing a good probability distribution over legal moves

2007-05-17 Thread Brian Slesinsky
I think there is something to this; it seems like it should be possible to use a database of randomly selected positions from games along with the best known followup, and use that as a faster way of testing a program's strength than playing full games. Such a database would be valuable for all s

Re: [computer-go] Amsterdam 2007 paper

2007-05-17 Thread Magnus Persson
Thanks, for sharing this early! Everything in this paper makes perfect sense, and is well written. Reading it is like finding the last pieces of a difficult puzzle. I do however have some problem with the math of section "2.5 A Minorization-Maximization Algorithm". I have a feeling that a lot of

Re: [computer-go] Amsterdam 2007 paper

2007-05-17 Thread Sylvain Gelly
Hi Rémi, Thank you for this paper. I found the work very interesting, well written, and the paper is clear and pleasant to read. As two things are modified in the same time (simulation policy and tree search), I wonder what is the contribution of each part in the overall improvement. For example