[computer-go] Some ideas how to make strong heavy playouts

Magnus Persson Tue, 01 Apr 2008 09:08:31 -0700

A recurrent concept popping up in discussions on how to improveplayouts is "balance". So I would like to try to share my philosophybehind the playouts of Valkyria and how I define "balance" and how itrelates to the evaluation of go positions.


*Background

In an old school program the evaluation function would try to seewhich stones are safely connected to other stones of the same colours.Connected stones are called groups, and the program would probablyalso try to evaluate the safety of the groups looking at the eyespaceat hand, weak neighbouring groups and so on. This quickly gets verycommplicated. My old program Viking had several 1000's of handmadepatterns for evaluating connectivity alone. This worked as a dream aslong as the position consisted of patterns in the database... but ineach an every game there were new situations and new patterns had tobe added. A more robust method would be to use tactical search in theprogram to evaluate connectivity. The problem there is to ensureaccuracy efficiently. Any tactical search tends to either become tootime consuming, or resort to guessing.


*MC-evaluation

Programs using uniformly random MC-eval favors very solid butinefficient shape, often building blocks of adjascent stones in thecenter of the board. The reason is that if stones are played moreloosely the stones often get cut off and get killed in the simulations.

What we rather want is a program that can play efficent moves wherestones are safely connected but occupy as much territory/eyespace aspossible.

The tree search (UCT) cannot alone solve this problem. Shapes createdin a 19x19 game may exist untouched to the late endgame and it is notpossible to read out all shapes on the board. It is much better ifsecure shapes stay stable in the simulation.

They way I implemented that in Valkyria is that the playout part isbasically reactive to random moves that attacks shapes on the board.It does not in any sense attempt to play the best move on the board.If it does not need to defend a shape it plays uniformly randomsomewhere. [Note that Valkyria also prunes really ugly moves, thus itplays uniformly the first move that is not pruned]

This is also how the pattern system works in Mogo as I understand it.If I remember correctly I would say that all Mogo patterns are verybasic and common sense defenses against attacks on otherwise stableshapes.

But there also have to be balance. Valkyria also punishes bad shape.That is if a weak shape already is on the board, or a random moveattacked two shapes simulatanously in the simulation, then the programmay attack the weakness (or in a way it also reacts to the situationdefending against "the weak shape becoming stronger"). Often the samemove that would have been the proper defence is played.



*Eliminating noise rather than predicting  the best move

Nothing I wrote above is original or new to the readers of this list.But I want to make a distinction between systems that tries to predictthe best move and a system that only tries to eliminate noise from theotherwise very random simulations.

Noise is eliminated when strong stones live and weak stones die almostalways in the simulations. This way the random evaluations will mostlyreact to moves that matter in urgent fighting with shapes that are notyet stable. A MC-program that does this should stop defending andattacking strong shapes and would require much less simulations todiscriminate between good and bad moves. Valkyria2 and Valkyria3 hasslightly different tree search algorithms but uses the same playouts.Both versions needs only 512 playouts per move to win 50% againstGnugo 3.7.10.

Still I think predicting the best moves is very important in the treepart, but this may be much less important in the playouts, and perhapseven detrimental as some people have experienced.


*19x19

The best programs on 19x19 seems to focus the uct search on localfighting. This temporarilly overcomes the biases of the simulationslocally. But the information gained locally about good shape in thetree is forgotten when the fighting moves elswhere. But this knowledgecan then be rediscovered later if the fighting comes back. Could afuture improvement to 19x19 go be to use locally narrow searches thatseeds the playouts with strong patterns for the current shapes on theboard? Maybe someone is already doing this? A really strong approachwould be to eliminate the need of hard coded patterns or offlinepattern harvesting and let the program learn during the game.


-Magnus
_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

[computer-go] Some ideas how to make strong heavy playouts

Reply via email to