Re: [computer-go] UCB1-Tuned distribution

Jason House Thu, 24 Jul 2008 11:26:10 -0700

On Jul 24, 2008, at 1:45 PM, John Stogin <[EMAIL PROTECTED]> wrote:

It seems that the UCB1-Tuned algorithm uses variance from a normaldistribution, however we believe it would be more optimal to usevariance from a beta distribution. Has any work been done in thisarea? Are people still using UCB1-Tuned to guide their explorationsof moves?

I removed it from my code a few weeks ago. It was partly to reducetemplate-based complexity and partly because it didn't really fit withmultiple win-rate estimators (e.g. RAVE, heuristics).

I recently derived a simple way to combine multiple multipleestimators, and I altered my code to use it.

A long time ago, I posted about using a priori knowledge for thedistribution of winning rates and deriving a beta distribution.

Thanks,
John Stogin
_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Re: [computer-go] UCB1-Tuned distribution

Reply via email to