>My first monte carlo programs used ownership info very effectively,  but it 
>can be that by using 
>AMAF this information is used already.

As a relative beginner in these matters, the more I look at AMAF,
the less I like it, and I think that has to do with AMAF ignoring
relative sequencing. By looking at all moves as if they were played
first, it ignores that some moves only make sense after certain other
moves. I haven't yet been able to pin this down with evidence,
because the "the game is lost, all moves are equally bad" effect
tends to obscure it (I guess I need to insert "resign" moves at the
right places to limit the game records to the interesting pieces, even
if that means my bot can't score the result).

If I ignore that the Go board is 2d (dimensions, not dans;-), and think
only of time (number of moves), space (board positions), and alternatives
(simulation runs), then ownership maps project this 3d spacetime multiverse
onto a 2d space histogram (likelihood of each position being black/white)
while AMAF tries to project these three dimensions onto a 2d time
histogram (likelihood of each move resulting in win/lose).

As the various tweaks (don't count moves by other player, don't count
moves already played, value early moves higher than later moves) and
remaining bot blunders demonstrate, that doesn't quite work right, in
spite of first appearances.

My (as yet unproven) hypothesis is that the issues stem from AMAF
trying to ignore the time dimension of its projection, hence you get
highly rated moves that place themselves into atari (because that would
have been the last, necessary, move to capture the opponent group,
*after* surrounding it), that would be suicide right now (but not at some
later point in many playouts), or even on an occupied position (no longer
occupied later in many playouts), that try to fill an opponent's eye (which
would make sense only if the second, nearly complete eye were prevented
first, but that doesn't always happen in the same way, hence lower rating
for each of the possible ways).

Giving priority to early moves alleviates this somewhat, as the experiments
show, but then you still don't get perfect loss or win results, even if the
board is effectively owned by one of the players, because the tweaks
have removed some data from the statistics to cover the worst holes.

There should be a variation of AMAF that records move win rates in
move sequences ("these moves are good, but this one always comes
after that one; never try this one unless you are sure you can make that
one as well"), preserving more of the structure in the time dimension
of its projection without just reproducing the whole 3d input data space.

Given that ownership maps don't suffer from this kind of trouble (unless
one collapses their information to score, which has been said to perform
badly), it is surprising that they are hard to make good use of. It might
just be a matter of asking different questions: AMAF variants try to
answer which move is likely best to make first; ownership maps answer
which positions are likely to be mine in the end. The latter is not as directly
useable, at least not as long as we are only asking about the next move.

That continues with UCT: apart from offering scaffolding for introducing
more Go knowledge, it also introduces a way of interleaving use of
playout information with continued generation of it, but it tends to do so
in a way biased towards move sequences. Again, not the kind of
question that ownership maps can offer answers to, but that just means
that there are other questions we are not asking yet.

Perhaps there is a complement to UCT/Amaf that achieves such
interleaving based on territorial information? One could allocate the
first half of simulation runs to acquiring an ownership map, then use
that information to narrow down the second half of simulations to
the more interesting parts of the board (perhaps less than half the
runs are needed to weigh the rest successfully?).

Claus

PS. Is there a location where one can find the current versions of
    the plain Amaf reference bots? I seem to be reaching the stage
    where issues I find are not always in my own code, but I don't
    know whether, eg, JrefBot 081016-2022 is the latest version).



_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Reply via email to