ofcourse you are correct, P = 1.0 is just the random player.  Obviously the
ELO as a function of P is going to be continuous.  So, being really close to
P=1.0 will make for a player that is only very slightly better than random.

I think it is also interesting to consider a player worse than random.  Take
your 1 trial MC program and instead of playing only moves than win, play
only moves that lose.  My guess is that the skill difference between this
program and random would be greater than between random and 1 trial MC, but
I would be interested to see a trial of this.

On 1/25/07, Stuart A. Yeates <[EMAIL PROTECTED]> wrote:



On 1/25/07, Don Dailey <[EMAIL PROTECTED]> wrote:
>
>
> I also had a difficult time producing a player that was less than
> 200 ELO stronger than a random player.   Even a single play-out,
> which seems hardly enough to discriminate between moves, is
> enormously stronger than a random player.    It was pretty much
> like this:
>
>    ASSUME computer is black


 0.  with probably P, play a random move (using the same selection
methodology as the random player)

   1. play 1 random game.
>
>    2. If black wins,  play one of the first N black moves in the
>       play-out  (all-as-first, for me it's some-as-first.)
>
>    3. If white wins, play one of the black move NOT in the play-out.
>
>    4. Crush a random player!
>


Surely by varying P, you can get a player arbitarily close to the random
player?

Or am I missing something?

cheers
stuart

_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/


_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Reply via email to