Re: [computer-go] Re: Dynamic komi in commercial programs

Don Dailey Sun, 12 Jul 2009 18:34:35 -0700

On Sun, Jul 12, 2009 at 8:10 PM, Darren Cook <dar...@dcook.org> wrote:


> >> I would like to know what exact experiments with "virtual komi"
> >> have been made and why thay failed. ...
>
> I'm only aware of Don's experiment [1], which he admits he doesn't have
> any details for and only remembers: "I did a bunch of experiments and
> ALWAYS got a reduced wins when I faked the komi".
>
> On the other side we have some experiments by Kato-san [2] (where he
> reports a 100 ELO improvement over GnuGo, but only from a few tens of
> game) and a subjective experiment by Okasaki-san where he "reported Mogo
> played clearly stronger" on KGS [3].
>
> My own experiments are even more subjective and small-scale, and in the
> context of 9x9 endgames, not 19x19 handicap openings. However they were
> enough to make me think the technique is viable, but that if you don't
> adjust the komi down so the winning rate is near 50% it is wasted effort
> (*), and so you need to replay the same move over and over with
> different komi until you zero in on that point.
> *: I.e. the program still plays weak moves if you've only adjusted komi
> to go from 80% to 65%, or from 25% to 35%.
>
> >> kill all - instead you just overplay a little in order to catch up
> >> slowly but steadily.
> >
> > You just hit the nail on the head.   Dynamic komi does not encourage
> > a program to overplay the position.   Since you are starting from a
> > losing position you HAVE to overplay a bit.   You have to attack when
> > it is futile.
>
> If the handicap is correct then you don't really need to overplay. As
> the stronger player you might guide the game towards more complex
> positions to encourage more mistakes, but mainly you are just sitting
> around waiting for those inevitable mistakes.
>
> But, the real point of adjusting komi is it is an easy to understand way
> to overcome MCTS's problem when seeing all moves as winning/losing, and
> choosing effectively randomly instead of falling back on an opponent
> model as a human would do.
>
> Ingo's suggestion (of two buttons to increment/decrement komi by one
> point) was to make it easy for strong humans to test out the idea for us.


There is no question that if you provide a button to push,  all kinds of
positions will appear where this idea works.  Providing a button is not
nearly the same as providing an actual working algorithm that you can prove
is superior.

So if you can do this in a verifiable way I'll be interested.

- Don






>
> Darren
>
>
>
> [1]:
> http://computer-go.org/pipermail/computer-go/2008-August/015870.html
> [2]:
> http://computer-go.org/pipermail/computer-go/2008-February/014283.html
> [3]:
> http://computer-go.org/pipermail/computer-go/2008-August/015877.html
> _______________________________________________
> computer-go mailing list
> computer-go@computer-go.org
> http://www.computer-go.org/mailman/listinfo/computer-go/
>

_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Re: [computer-go] Re: Dynamic komi in commercial programs

Reply via email to