[computer-go] MC evaluation and go knowledge

Magnus Persson Thu, 13 Dec 2007 06:53:05 -0800

I just want to make some comments about MC evaluation to remove somecommon misunderstandings.

I have seen some complaints about misevaluation such as a programhaving 65% chance of winning in a game which is lost and the other wayaround. For example arguments has been proposed in line with "sinceevaluation is so bad there has to be a better way of doing it".

I just want to point out one thing: any Winrates except 0% and 100% iswrong assuming perfect play. 1% and 99% (or anything in between) meansthat the program is not sophisticated enough to either a) always playa winning move in the simulations b) search deep enough to solve thegame proving a win/loss.


*BUT*

Having an incorrect evaluation is unimportant, as long as the programplays the best move. What really matters is which move has the highestwinrate relative to all other candidate moves.

MC-programs with no knowledge except avoiding eyefilling oftenevaluate all positions as being very close to 50%. As soon as one addsappropriate knowledge there is a much larger range between the bestand worst move on the board. Normally this range is correlated withthe strength of the program. (Be aware that buggy programs might haveeven larger ranges though).

Also with UCT the winrate at the root has little to do with anyobjective probability of the program winning. If one look at theprinciple variation at the end of the game you will notice a largedifference in winrate at each depth. The winrates at the root changevery slowly even if it is 0% or 100% at the leaves, but still therelative ordering of the moves at the root is often correct.


*FINALLY*

The use of MC-eval and UCT are *completely orthogonal* to using goknowledge in the program. You can add any kind of knowledge at allstages to the basic search algorithm and possibly benefit from it.

The problem is enginering. If your implementation of the knowledge aretoo slow the program gets weaker with fixed time limits. The knowledgeyou added might make the program weaker for a number of reasons.

Arguments such as "I saw program X misplay situation Y and thereforeMC-eval is flawed" is just plain wrong. It just mean that specificprogram X has a flaw and nothing else.

What one can argue are "I wrote a program with a nice new method forevaluation and a new approach to searching that plays situation Ycorrectly, and also happens to beat program X all the time using thesame hardware". Until I see that argument, I will continue to beleivethat methods similar to MC-eval and UCT-search are the future ofcomputer go.


-Magnus
_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

[computer-go] MC evaluation and go knowledge

Reply via email to