Re: [computer-go] How to "properly" implement RAVE?

Magnus Persson Wed, 21 Jan 2009 04:24:02 -0800

Quoting Thomas Lavergne <thomas.laver...@reveurs.org>:

  - the best play is a good only if played immediatly and very bad if
    played later in the game :
  - the first playout for this play resulted in a lost.
score and RAVE score will be very low and this play will never be
considered again until a very long time.



You raise an interesting concern.

The simple solution to your question is to add an exploration termusing UCT for example. Then it becomes an empirical question whatparameter for exploration gives the strongest play. My experience isthat the best parameter is so small it can be set to zero.

I think the conditions you defined are very rarely completelyfulfilled. What can be true often however is that a single bad movemakes the best move very bad if played later in the game. If the badmove happen to be the second best move, it will be searched a lotlowering the AMAF score (rw/rc) for the best move.

This is likely to happen when there are several local moves that moreor less solves the same problem. That is when one move is played theeffect of the other move played later will overlap with the first.


-Magnus


_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Re: [computer-go] How to "properly" implement RAVE?

Reply via email to