>My first monte carlo programs used ownership info very effectively, but it >can be that by using >AMAF this information is used already.
As a relative beginner in these matters, the more I look at AMAF, the less I like it, and I think that has to do with AMAF ignoring relative sequencing. By looking at all moves as if they were played first, it ignores that some moves only make sense after certain other moves. I haven't yet been able to pin this down with evidence, because the "the game is lost, all moves are equally bad" effect tends to obscure it (I guess I need to insert "resign" moves at the right places to limit the game records to the interesting pieces, even if that means my bot can't score the result). If I ignore that the Go board is 2d (dimensions, not dans;-), and think only of time (number of moves), space (board positions), and alternatives (simulation runs), then ownership maps project this 3d spacetime multiverse onto a 2d space histogram (likelihood of each position being black/white) while AMAF tries to project these three dimensions onto a 2d time histogram (likelihood of each move resulting in win/lose). As the various tweaks (don't count moves by other player, don't count moves already played, value early moves higher than later moves) and remaining bot blunders demonstrate, that doesn't quite work right, in spite of first appearances. My (as yet unproven) hypothesis is that the issues stem from AMAF trying to ignore the time dimension of its projection, hence you get highly rated moves that place themselves into atari (because that would have been the last, necessary, move to capture the opponent group, *after* surrounding it), that would be suicide right now (but not at some later point in many playouts), or even on an occupied position (no longer occupied later in many playouts), that try to fill an opponent's eye (which would make sense only if the second, nearly complete eye were prevented first, but that doesn't always happen in the same way, hence lower rating for each of the possible ways). Giving priority to early moves alleviates this somewhat, as the experiments show, but then you still don't get perfect loss or win results, even if the board is effectively owned by one of the players, because the tweaks have removed some data from the statistics to cover the worst holes. There should be a variation of AMAF that records move win rates in move sequences ("these moves are good, but this one always comes after that one; never try this one unless you are sure you can make that one as well"), preserving more of the structure in the time dimension of its projection without just reproducing the whole 3d input data space. Given that ownership maps don't suffer from this kind of trouble (unless one collapses their information to score, which has been said to perform badly), it is surprising that they are hard to make good use of. It might just be a matter of asking different questions: AMAF variants try to answer which move is likely best to make first; ownership maps answer which positions are likely to be mine in the end. The latter is not as directly useable, at least not as long as we are only asking about the next move. That continues with UCT: apart from offering scaffolding for introducing more Go knowledge, it also introduces a way of interleaving use of playout information with continued generation of it, but it tends to do so in a way biased towards move sequences. Again, not the kind of question that ownership maps can offer answers to, but that just means that there are other questions we are not asking yet. Perhaps there is a complement to UCT/Amaf that achieves such interleaving based on territorial information? One could allocate the first half of simulation runs to acquiring an ownership map, then use that information to narrow down the second half of simulations to the more interesting parts of the board (perhaps less than half the runs are needed to weigh the rest successfully?). Claus PS. Is there a location where one can find the current versions of the plain Amaf reference bots? I seem to be reaching the stage where issues I find are not always in my own code, but I don't know whether, eg, JrefBot 081016-2022 is the latest version). _______________________________________________ computer-go mailing list computer-go@computer-go.org http://www.computer-go.org/mailman/listinfo/computer-go/