[computer-go] ICML paper

David Silver Wed, 23 Apr 2008 13:58:08 -0700

Hi all,

Our paper "Sample-Based Learning and Search with Permanent andTransient Memories" has been accepted for publication at ICML 2008.Once again, I would really appreciate any feedback or comments on thepaper.


http://www.cs.ualberta.ca/~silver/research/publications/files/dyna2.pdf

ICML is a technical conference on machine learning, and the paperassumes some familiarity with machine learning terminology. Havingsaid that, I hope that the main ideas are understandable by everyoneon this list! In particular I've heard several people ask questionslike:


-How can we generalise between different positions during search?
-How can we learn "on the fly" about the value of different patterns?
-How can we combine learned knowledge with search?

-Are there more efficient algorithms than Monte-Carlo for updating thevalue of states or patterns?-What's so special about UCT? Can we get similar or better performanceby using other sample-based search algorithms?

The Dyna-2 architecture described in this paper provides an approachfor answering these questions.

Hopefully this will pique your interest enough to read the paper :-)

Thanks!
-Dave

_______________________________________________
computer-go mailing list
[email protected]
http://www.computer-go.org/mailman/listinfo/computer-go/

[computer-go] ICML paper

Reply via email to