I guess it really depends on what the point of the test is.  I'm trying to
understand the performance gap between my AMAF bot(s) and Don's AMAF bots.
For comparison, here's the ratings and # of simulations:

ELO
 1434  - ControlBoy    - 5000 simulations per move
 1398  - SuperDog      - 2000 simulations per move
 1059  - ReadyFreddy - 256 simulations per move
 763    - Dodo             - 64 simulations per move
<600   - all my amaf    - 5000-25000 simulations per move
<300   - ego110_allfirst- ???

Looking at the cross table with ReadyFreddy, running (that's doing 5% of the
work that my bots are), the results are 0/14, 0/20, 0/24, and 0/10.  Even
with the small samples, I'm quite certain that the performance of my bot is
way worse than any of Don's.

I'm not particularly concerned if alternate eye method #1 is marginally
better than #2 (or vice versa).  I'm reasonably confident that their
performance is similar and that their performance is better than my original
method.

I'm content for now to find out the major causes of performance gaps and
then revisit what is truly the best combo when I get around to doing quality
coding of features instead of quick hacks for testing.  Currently, both the
random move selection strategy and the game scoring strategy have come under
question.

On 9/20/07, [EMAIL PROTECTED] <[EMAIL PROTECTED]> wrote:
>
> I'm going to echo Cenny's comment. Small samples like this can be very
> misleading. For this kind of test, I usually give each algorithm 5000
> playouts per move and let them play 1000 games on my computer. It takes
> about a day and a half.
>
> - Dave Hillis
>
>
> -----Original Message-----
> From: Cenny Wenner <[EMAIL PROTECTED]>
> To: computer-go <computer-go@computer-go.org>
> Sent: Tue, 18 Sep 2007 3:33 pm
> Subject: Re: [Housebot-developers] [computer-go] ReadyFreddy on CGOS
>
> By the data in your upper table, the results need to uphold their mean
> for 40 times as many trials before you even get a significant*
> difference between #1 and #2.
>
> Which are the two methods you used?
>
> On 9/18/07, Jason House <[EMAIL PROTECTED]> wrote:
> > original eye method = 407 ELO
> > alt eye method #1   = 583 ELO
> > alt eye method #2   = 518 ELO
> >
> > While both alternate methods are probably better than the original, I'm not
> > convinced there's a significant difference between the two alternate
> > methods.  The cross-tables for both are fairly close and could be luck of
> > the draw (and even which weak bots were on at the time).  I put raw numbers
> > below.  Since I made one other change when doing the alt eye method, I
> > should rerun the original with that other change as well (how I end random
> > playouts and score them to allow for other eye definitions).
> >
> > While I think the alternate eye definitions helped, I don't think they
> > accounted for more than 100-200 ELO
> >
> > vs ego110_allfirst
> > orig= 33/46 = 71%
> > #1 =  17/20 = 85%
> > #2 =  16/18 = 89%
> >
> > vs gotraxx-1.4.2a
> > orig=N/A
> > #1 = 2/8   = 25%
> > #2 = 3/19 = 16%
> >
> >
> > On 9/17/07, Jason House <[EMAIL PROTECTED] > wrote:
> > >
> > >
> > >
> > > On 9/17/07, Don Dailey <[EMAIL PROTECTED]> wrote:
> > > > Another way to test this, to see if this is your problem,  is for ME to
> > > > implement YOUR eye definition and see if/how much it hurts AnchorMan.
> > > >
> > > > I'm pretty much swamped with work today - but I may give this a try at
> > > > some point.
> > > >
> > >
> > > I'd be interested in seeing that.  It looks like my first hack at an
> > alternate eye implementation bought my AMAF version about 150 ELO (not
> > tested with anything else).  Of course, what I did isn't what others are
> > using.  I'll do another "alteye" version either today or tomorrow.  It may
> > be possible that some of my 150 was because I changed the lengths of the
> > random playouts.
> > >
> >
> >
> > _______________________________________________
> > computer-go mailing list
> > computer-go@computer-go.org
> > http://www.computer-go.org/mailman/listinfo/computer-go/
> >
>
>
> --
> Cenny Wenner
> _______________________________________________
> computer-go mailing list
> [EMAIL PROTECTED]://www.computer-go.org/mailman/listinfo/computer-go/
>
>  ------------------------------
> *Check Out the new free AIM(R) 
> Mail*<http://o.aolcdn.com/cdn.webmail.aol.com/mailtour/aim/en-us/index.htm>-- 
> Unlimited storage and industry-leading spam and email virus protection.
>
> _______________________________________________
> computer-go mailing list
> computer-go@computer-go.org
> http://www.computer-go.org/mailman/listinfo/computer-go/
>
_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Reply via email to