Łukasz Lew wrote:
> If the weight in RAVE formula is near 1 in one child of tree and near
> 0 in other then you basically compare RAVE value to regular average
> value, which might be comparing apples to oranges.

Yes, and this can cause problems in practice. There's been some
discussion of this before.

In positions where the RAVE values tend to be too high, the effect is
that moves with few visits will be favoured, which will then equalise
the RAVE weight again. The effect is rather like temporarily increasing
the exploration coefficient, and nothing very bad happens.

But in positions where the RAVE values are too low (which mostly means
positions where the current player is winning), the effect is worse: the
program will be reluctant to explore different moves, and this time
there is positive feedback (the RAVE weights will diverge) and so the
situation won't correct itself.

-M-
_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Reply via email to