Re: [computer-go] Re: Explanation to MoGo paper wanted.

chrilly Thu, 05 Jul 2007 23:13:42 -0700

I think one of the problems is in testing. Currently we have almost
no way to judge whether a improvement is good or bad, other than
playing a lot of games against GNU Go. It takes very long time and
seems inefficient. Moreover, even it may not be a very good method.
GNU Go often cannot respond to an obvious bad move correctly, so
pruning such moves decrease the winning rate.

This is THE problem in game programming. To measure progress. Usually animprovement is worth 10 Elo. It takes about 1000 games to determine withstatistical significance such an improvement. Usually one does not make 1000games, 100 games are already quite a lot. One chooses often not the best butthe most lucky version. If one version has an especially good result I rerunthe test-matches under different conditions (time setting).

Only if the results are repeatable, the version is considered best.

If an improvement is worth 100 Elo, there is no need for extensive testing.One sees this immediatly. In fact also smaller improvements are in the endchosen by intuition/feeling.

In Go things are insofar worse as there is only one standard sparringpartner, Gnu-Go. This creates severe inbreeding effects. In chess there wasa similar problem. There were more strong opponents around, but over theyears they become very similar. Suddenly there was a new programm, Rybka,which plays different and all the inbreedings have a lot of difficulties.

I think there is no better way. One can do some pre-filtering with testpositions. If a version is especially bad in these tests, one can ignore it.But being good in test positions and in games are different things.


Erdstrahlen:

Jan Louwman was a fanatic tester. His small house was full ofboard-computers. He played by hand 20 games at once (we are in the pre-PCcomputer chess times).He always reported spectacular results for the programms of Ed Schroeder.But when the programms went to market, nobody could replicate Jans results.The programms were strong, but not spectacular. Thomas Mally of the Vienneschess magazine Module explained this with the different natural radiation(German "Erdstrahlen") in Rotterdam and elsewere. Eds programm wereoptimized for this "Erdstrahlen". The "Erdstrahlen-Theorie" become a runningjoke in the chess-community. Whenever 2 testers reported quite differentresult, it was "explained" by the different amout of "Erdstrahlen".

It is impossible to play by hand 1000 games for each version. Jan usuallyplayed with 30 sec. or 1 min/move. It would have taken forever. Hisspectacular version was just a very lucky one. If you play enough, youalways get one. But his testing was certainly a significant contribution tothe development of Rebel. And it was a very good medicine for Jan. He wouldhave died much earlier without this testing.


Chrilly

_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Re: [computer-go] Re: Explanation to MoGo paper wanted.

Reply via email to