Re: [computer-go] Monte-Carlo Tree Search reference bot

Michael Williams Tue, 02 Dec 2008 21:17:14 -0800

That's going to repeat the same exact path through the tree three times, isn't it? If so, it seems like it would be more efficient to do N playouts from theleaf after each traversal of the tree.


[EMAIL PROTECTED] wrote:

There is another speedup trick that might interest you. IIRC Lukasz Lewcame up with it, but I forget what he called it. After calculating theselected move for an internal node (going through UCT and RAVE andwhatnot) store that move. Then, for the next N visits to that node(where N is 5 or 10 or so), just repeat that move without having tocalculate what the move would be.
- Dave Hillis

-----Original Message-----
From: Mark Boon <[EMAIL PROTECTED]>
To: computer-go <computer-go@computer-go.org>
Sent: Tue, 2 Dec 2008 11:17 pm
Subject: Re: [computer-go] Monte-Carlo Tree Search reference bot
I have made some minor performance improvements and this is as far as Iintend to take this particular project. I might make some small changesif necessary, but most likely I'll leave this largely unchanged from now.I had set myself as an arbitrary goal that it should do at least 20Kplayouts. But with real liberties, AMAF and a RAVE formula I got stuckin the 16K-17K range. According to my profiler that is mostly due to theexpensive formula used to compare nodes, where it says it spends 25% oftotal time. The formula I'm using is:beta * (virtual-win-ratio + RAVE) + (1-beta) * (win-ratio + UCT)beta = 1 - log(parent-visits) / 20UCT = exploration-factor *sqrt( log(parent-visits) / (nr-visits+1) )RAVE = exploration-factor *sqrt( log(parent-visits) /(nr-virtual-visits+1) )There are probably quite a few possibilities still to tune this programwith regards to playing strength and performance. But I felt it doesn'thelp to obscure the implementation by too many details.The implementation of the search algorithm was entirelygame-independent, until I introduced AMAF. I didn't see how to fix that,as there's no way getting around that it's based on the fact that a moveis represented by a single coordinate, which is mapped to an array tostore the statistical values. But strip the AMAF part, which is a blockof 30 odd lines of code, and I think it can be used for other gamesbasically as-is. I did this not because I ever see myself programanother game, but because in my experience in doing so I get a cleanerseparation between modules.At 2,000 playouts, it's still quite a bit weaker than the plain MC-AMAFrefbot. It wins only about 33%. But that's probably because the1,000-2,000 playouts range is the sweet-spot for that particular type ofplaying engine. It doesn't scale from there, whereas the MCTS ref-botonly just gets warmed up with 2,000 playouts.This leads me to a question. I suppose it might be of some interest toput this bot up on CGOS. But what parameters to use? The main one beingthe number of playouts, naturally.Mark_______________________________________________computer-go mailing listcomputer-go@computer-go.org <mailto:computer-go@computer-go.org>http://www.computer-go.org/mailman/listinfo/computer-go/
------------------------------------------------------------------------
Tis the season to save your money! Get the new AOL Holiday Toolbar<http://toolbar.aol.com/holiday/download.html?ncid=emlweusdown00000008>for money saving offers and gift ideas.
------------------------------------------------------------------------

_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/


_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Re: [computer-go] Monte-Carlo Tree Search reference bot

Reply via email to