On 31-mrt-08, at 12:36, Don Dailey wrote:

Are these fixed patterns or wildcard patterns?      I'm interested in
wildcard patterns too and how to automatically generate them.

A wildcard pattern is exactly the same as a decision tree (it can be
represented perfectly by a decision tree.) There are several methods in AI that have similar function and performance to decision trees - and
that's why these methods are interesting to me.   They are different
ways of looking at wildcard pattern generation.


These are wild-card patterns. And I do indeed represent them as a decision tree.


2) When I used these patterns during simulation the playouts suddenly
look surprisingly like normal Go compared to random playouts. However,
the program started to play worse. Much worse. Even when I let it
compute as many playouts as a program without patterns.
This of course is now a well known phenomenon.

Aha, well sometimes reinventing a few wheels is inevitable :-) It did surprise me.



The first observation made me wary to rely on harvested patterns. I
think it shows at least it needs to be used in combination with
hand-crafted patterns. Also it means that the fact you harvest a large
number of patterns isn't necessarily meaningful.
It isn't  necessarily the quality of the moves that is important it
seems.    All playouts, including uniformly random playouts, set up
certain type of bias.   Uniformly random playouts have the
characteristic that stupid errors are compensated by stupid errors by
the opponent and so tend to weakly balance out and return a "reasonable"
evaluation.

They balance, but tend to have a large deviation from the norm. 'Better' play during playouts should, in theory, reduce the size of the deviation.


Heavier playouts have been shown to be far superior,  but just placing
stronger moves in the playouts does not seem to be the right
formula. My guess is that if you place arbitrary knowledge into the
playouts, you create terrible imbalances.   Perhaps the secret (I'm
entering uncharted territory here) is that it's ok to create imbalance
if it's the type the tree is easily able to correct.    This is my
working theory,  let's call it "maximum information gain."      The
early Mogo paper showed that it's good to look at defending opponents
atari. Defending an atari tends to be either one of the better or one of the worst moves on the board. It's a horrible move if it cannot be defended. HOWEVER, what you can say is that Mogo's playout strategy
pushes the search right away in the direction of  "finding out for
sure"   so this appears to cause the playouts to create a hypothesis
that is either quickly refuted or quickly confirmed.    I really think
you want to push towards positions that "work out" what is really going
on, even if you have to insert bias into the the playouts to get this
(which you probably do.)


I don't know about this. I'm pretty sure MoGo checks if the stone can make at least two liberties (ladder problem) in which case it can still be horrible but very seldomly worse than random. It seems to me that in principle whenever you have a 'pattern' that has a higher success-rate than a random move over a large number of games, then it should always be preferred over the random choice. I think it's the lack of randomness that's the problem. The large number of playouts tend to converge to a mistake if it's deterministic. It doesn't even have to be totally deterministic I think.


The second observation I think may have been caused by not enough
randomness. But that means I first have to find an answer to how much
randomness I need to put into the patterns. I'm first looking at this
question with some hand-crafted patterns to get a better handle on
this issue.
Let us know.   The whole issue is pretty interesting.    I think
randomness is required only to counteract systematic biases because
obviously if you playouts played perfectly, there would be no need for randonmess. And yet better than random playouts can also lead to worse
play in general.


- Don



_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Reply via email to