Re: [computer-go] How to "properly" implement RAVE?

Mark Boon Wed, 21 Jan 2009 06:21:43 -0800


On Jan 21, 2009, at 11:53 AM, Olivier Teytaud wrote:

Here, we have a non-zero initialization of the number of wins, ofthe numbere of simulations, of the number of Rave-wins, of thenumber of Rave-losses.We have then a 0 constant for exploration, but also an exploratoryterm which is very different, and for which I am not the main author- therefore I let the main author
give an explanation if he wants to :-)
I point out that even before this exploratory term, the best UCB-like exploration-constant was 0 - as soon as the initializations ofnumbers of wins, of losses, of Rave-wins, of Rave-losses areheuristic values.

I'd like to make sure I understand what you mean exactly. You use someheuristics to intialize all the moves (or maybe some of the moves)with a certain win-loss and rave-win-loss ratios?

To a certain extent I suppose these could come from the reading of theprevious move? I think I slowly start to make sense of things...


Mark



_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Re: [computer-go] How to "properly" implement RAVE?

Reply via email to