Re: [computer-go] Re: Scalbility study: low end

Hideki Kato Thu, 24 Jan 2008 16:33:01 -0800

steve uurtamo: <[EMAIL PROTECTED]>:
>it's a natural tendency to look for patterns
>in data as soon as you have any data at
>all.  some of these patterns i'd be willing to
>bet will hold up over time -- but the bayesian
>in me would say that is simply because they have
>further given evidence for my prior beliefs.
>
>requiring everyone to stay out of everyone
>else's first standard deviation will take quite a
>few more trials, and probably won't change the
>order of the existing mogos 1-12, for instance.
>
>if you were to plot these datapoints with their
>1st or 2nd std. dev. errorbars and look at the possible
>set of curves that you could fit through them, though,
>it'd give quite a funny story, i'd agree.  :)
>
>95% "confidence" is a bit misleading and overrated,
>in my opinion.


Oh, I intended to say that this is 68% of confidence interval and if 
you want 95% then ...   Thank you for pointed out.

-Hideki

>s.
>
>----- Original Message ----
>From: Hideki Kato <[EMAIL PROTECTED]>
>To: computer-go <computer-go@computer-go.org>
>Sent: Thursday, January 24, 2008 8:34:42 AM
>Subject: [computer-go] Re: Scalbility study: low end
>
>
>Heikki,
>
>The numbers of games are about 200 and their ratings' standard 
>deviations (right of Elo) are 70 to 100, right now.  To get 95% of 
>reliability, you have to double them.  Don't you think it's too early 
>to conclude any?
>
>-Hideki
>
>Heikki Levanto: <[EMAIL PROTECTED]>:
>>Everyone is looking at the top end of the scalability study
>>   http://cgos.boardspace.net/study/
>>
>>But what happens in the low end? Both programs show linear progress to
> begin
>>with, then a corner, and more (almost?) linear development.
>>
>>Fatman's curve has a clear break at 3 doublings, when it suddenly
> starts to
>>improve much slower than before. This goes on until 12 doublings,
> after which
>>we get the mysterious decline.
>>
>>Mogo's curve is pretty well linear to 4 doublings, after that there is
> more
>>variation (I suppose random), but the overall scope is clearly not
> what it
>>was below 4.
>>
>>
>>It is possible that both programs have a subtle bug that starts to
> disturb
>>results around this point, but I find it quite unlikely.
>>
>>The breaks happe at 1350 - 1550 ELO points. Isn't that about the level
> where
>>plain MC stops improving with more playouts?  Would be fun to see if
> we could
>>isolate the playout parts of those programs, and let them play pure
> MC. My
>>guess is that they would end up around this level.
>>
>>
>>Could it be that there are other limiting factors higher up? Perhaps
> Fatman
>>is hitting the next one around 12 doublings, and Mogo will follow at
> 14 or
>>15... We will see that in a few days, when the new Mogos join the
> study and
>>start producing results.
>>
>>   - Heikki
>--
>[EMAIL PROTECTED] (Kato)
>_______________________________________________
>computer-go mailing list
>computer-go@computer-go.org
>http://www.computer-go.org/mailman/listinfo/computer-go/
>
>
>
>
>
>      
> ____________________________________________________________________________________
>Looking for last minute shopping deals?  
>Find them fast with Yahoo! Search.  
>http://tools.search.yahoo.com/newsearch/category.php?category=shopping
>_______________________________________________
>computer-go mailing list
>computer-go@computer-go.org
>http://www.computer-go.org/mailman/listinfo/computer-go/
--
[EMAIL PROTECTED] (Kato)
_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Re: [computer-go] Re: Scalbility study: low end

Reply via email to