Re: [computer-go] 2008 World 9x9 Computer Go Championship in Taiwan

Vincent Diepeveen Wed, 02 Jul 2008 19:55:05 -0700

From 1997 and onwards i managed to join in the computer chess worldchamps every year.

Besides the participants Stefan MK (Shredder), Shay Bushinsky andAmir Ban (both junior)and tournament director Jaap van den Herik and Joke Hellemons who isdoing the entireorganisation from ICGA side; besides that 'hard core' addictedcomputerchess guys and woman,all of them excellent company also in restaurants, the other few whoshow up are there at most

just for a year or 3.

If we forgive sometimes that people cannot show up 1 or 2 events,then we can also add

Gerd Isenberg (who might be missing in Beijing) and David Levy.

Especially David is even better company in a restaurant, sometimeshe pays the bill.

Note that in ICGA events it is very easy to find the board (Jaap,David and Joke).

In Paderborn 1999 someone needed them, so we adviced the guy to lookon the internet and search for the most expensivehotel and most expensive restaurant; indeed in hotel Arosa the ICGAboard had dinner.

I do not blame authors to not show up at events. Showing up isexpensive, and most authors have many talents and getasked for many different type of events. Most events you have to payfor.

Most of them only show up when their engine is 'in big shape'. Youcan be sure that the strongest engines also join an event.An author knows pretty well when his engine makes a chance of doingwell. The same will be true for computer-go.The programmers who know they make a chance of winning, they all willbe there for sure.

From the above elite group of ICGA members/participants that showsup a lot at the computerchess, at least 1 is not bribable.

Maybe that's why i am looking for a job now. Even having no surenessof income as of now,it would not occur to me, not even at gunpoint, to recognize someother event as the official world championship,when it is clear that the entire organisation was officially plannedand scheduled for China.

It is not a problem when others organize events, in contradiction,when others also organize events that's even better.

It is nice to have many events, that makes a science more popular.

Yet do not forget that the real and official event is in Beijing.

That aside, every year, when the location is far away, it is tougherto participate;the struggle to find a sponsor paying my bills is very tough.Sometimes there is years i succeed in that,

other years i pay it myself.

The ticket costs i have are far larger than the amount of money ICGApays back. Add a hotel and extra travel time (asthe airplane to it needs like 10 hours or so, so you lose quickly 2days). Additionally being born in a nation where it hardlyever gets hot, i cannot stand heat very well. I figured that out inTel Aviv 2004.

In computerchess there is another cost, that most of you ignore.Hardware. Easy if you come from Japan, USA, Germany, UK or France.I come from a tiny nation. There is no highclocked 8 core machineshere that i can get from sponsors to test and play at in this tiny

nation of 16.5 million inhabitants.

Now that computer-go gets slowly more mature in this sense thatengines start to beat slowly more human beings,and not soon from now will beat 99% of all go-players, after which itwill take endless until you can conclude objectivelythat they are stronger. Of course for the average Joe on the street,he will conclude 10 years before it happens that themachine is stronger, as the go world will figure out within a year or10 from now that the human side has a major weaknessthat will cause it to lose 10 years sooner than it should: a humanbeing is in big need of money and therefore lose for money.This is partly because of arrogance. They do not prepare and playsilly against the machine, usually several 10 kyu movesget made during the game, to still let the game look interesting andespecially to give the sponsor an interestingresult. This instead of beating it bigtime. The stronger the player,the more dumber social. It will be one of those players who will lose.If the best player cannot get bribed, like Topalov refused a matchunder the conditions offered,they'll skip and wait until a next world champion is there who isbribable (why do those always have a Russian passport?).

Even winning 10 matches after by human side, or losing games to tinyGrandmasters, that doesn't matter. The first match a world topplayer loses will get all attention. The damage will be done and theaverage Ye on the street will consider the machine stronger.Ye is always right; this is because Ye follows the first marketingprinciple. The first marketing principle says:

        "It is better to be the first than the best"
        (Al Ries & Jack Trout)

For a titled player like me, comparable to about 1-4 dan professional(i'm 4 dan only and only when a game is real important to my team,in all others i'm at most 1), i needed a bunch of years to draw thatconclusion; it was rather hard to swallow.

Kasparov lost in 1997 from Deep Blue. A 10 ply searching chess enginein the opening, forward pruning in the last few plies even.Todays go programs search deeper than that, to give an example. Inthe year 2000 when eating in a fastfood Hamburger restaurant,i played a blindfolded match against a 10 ply searching program.Though very bad and from France, i beated it blindfolded.


You get the picture.

Now in 2008 i'm pretty sure machine is stronger.

Not because it is anyhow qualitatively better than me, but sometimesi make mistakes in those bizarre positions they play,and then you're dead against a machine making a single big mistake.Go is a game where that type of play is possible even better.

They create total dubious chaos at the board in a dubious manner, butnot dubious enough to not lose directly,and then when being in that chaos, the machine will toast you whenthe number of possibilities get too much.


Go is a far better game objectively to create such chaos.

Thiefery, cheating, bribing and losing for money works. In the 21stcentury, Kasparov still gotmatch offers against the machine. Both for being moneygrabbers, mayhe and Fischer rot in hell.

Sometimes there is some copycats from the middle east, who producesome sort of a machine with FPGA cards which are

very low clocked.

Now we can of course first do a technical discussion. Namely thatshowing up with low clocked fpga cards in 2004 around 30Mhzis pretty pathetic. Even if you get 3-5 million nps, that's stillpathetic considering its design; it shares that crappy design withdeep blue.

Not ordering moves in a good manner and no caches loses you soonfactor 10. It only searched 18 ply worst case in a dubious manner.

Engines in 2005 single core to quadcore already got 18 ply.

An engine in hardware has huge if you either do one of the next 2things:


a) incorporate huge amounts of knowledge
b) making a multicore hardware processor swallowing really a big surface
     that is high clocked and has caches in hardware

Deep Blue and Hydra/Brutus failed in both respects.

The fast chessprograms (so the ones with tiny evaluation function,and as far as i know my chessprogram Diep is the only one with areally bigevaluation function), when programmed in C, so not even fully inassembler, are around 1000 cycles a node.So at a todays overclocked 8 core machine of 4Ghz, their onlylimitation in speed is the memory controller.Achieving a 10-20 million nps they easily do, just limited by thememory controller.

The hardware guys who made chessprograms with near to nochessknowledge, to move ordering and other tricks you soon need 10-20cyclesa node. So a FPGA card at todays 60Mhz or so can deliver at most 6million nps.Now that would be a phenomenal speed if it was my chessprogram whichsearches 200k nps at a 3Ghz core2,yet that isn't the case. Especially for my program getting 6 millionnps in hardware isn't much.

Search is so so inefficient in hardware, what i need to do the lastfew plies with Diep, to do that in hardware is not impossible,but it eats a lot of sequential steps. Each sequential step inhardware is a cycle. Soon you're 40 to 60 cycles to do that.

Even then, you still miss RAM to cache searches. Even a 1024KBhashtable on-chip, would make a hardware chip fly.

This is the big mistake both Donninger and Hsu made.

They delivered both hell of an achievement perhaps in the eye of thehardware layman, they failed algorithmically like total laymen.

Hydra is factor 50 slower clocked on each processor, 30-60Mhz versuscpu's 3 to 4Ghz.Deep Blue was around 30Mhz clocked when cpu's were 300Mhz. Hsu wasfrom hardware viewpoint much better than Chrilly,making his CPU in VLSI, yet he for example even forgot to usekillertables in search. Something real simple and knownsince the 80s to work well. Chrilly did include killertables butforgot to create hashtables, total crucial in todays computer

chess search.

'forgetting' hashtables is really a beginnersmistake. No matter howhard it is to make them.

If you do not know how to make a transpositiontable in hardware,don't make a cpu.

A go/chess chip without transpositiontable is just a marketing chip.

It has no significance from technical viewpoint seen; it doesn'tspeedup the search, using its nps as a marketing instrument is

about the only thing it gives.

Todays cpu's you buy in the store, also are as good as its L1, L2 andeven L3 caches are.

The funny thing about Hsu is that where he is a hardware geniusgetting a chessprogram to work on a chip already in the 80s,he just cares about search depth in his thesis. Deep Blue however wasa total marketing machine. Some oldie RS6000 technology,combined with a hardware chess cpu that got 10/11 ply in opening andlater in game 12 ply. The machine had a theoretic capabilityof searching half a billion nodes per second or so. The funny thingis that just using nullmove, a technique well known to be a winnertechnique by 1995, as Frans Morsch told everyone it kicked butt forhim and won him the world title (and he deserves credits for that),

Deep Blue 1997 still isn't using nullmove.

At a 1024 processor supercomputer, using the biggest partition of 512processors, i did do several runs of Diep,without using last few plies a hashtable. That really hurts,especially when running parallel. The more cores and latency problemsyou have communicating between the different search processes, themore difficult it is to achieve a good branching factor.

Move ordering in software is real easy to do whereas in hardware andat GPU's it is real real tough.

Hydra therefore lose a factor 10 or so of there speed the last fewplies. What Deep Blue lost there we cannot even estimate.

Both Hydra and Deep Blue are hard to see as hardware chessprograms.They aren't doing entire search in hardware.

Their hardware concept is so ugly bad from branching factor viewpoint,

that both programmers decided to get the search into software as soonas possible,

doing the utmost minimum of search in hardware.

Hydra is doing mostly 2 to 3 ply searches in hardware, Deep Blue wasdoing 4 ply in hardware. Basically we must see thecpu's as 'coprocessors' therefore, which only for like 1 year givesome small boost to what is then the latest CPU.

I see Hydra as a cluster of 64 processors 2.8Ghz P4 which was strongend of 2004, start 2005.Even then, a 64 processor cluster still is good hardware. I'd behappy with a 64 processor cluster.


It achieved about 16-20 ply searches in the events i played it.

Todays programs at 4 cores, if i look to the world top, are alsogetting worst case about 18-20 ply first few moves out of book,quite a lot more after that (hydra doesn't scale there nor did deepblue, showing their hashtable weakness to full extend).

Knowing it forward prunes in hardware last few plies and is doingsomething dubious in software search,a hardware cpu of course only makes sense to produce when you clockit to 300Mhz, have 32 or more cores,and have for each core its own hashtable, at least 512KB. So you doeverything at 1 cpu.

Additionally you need more chessknowledge than todays fast programs;they objectively play real real passive chess,which wins for them because playing agressive chess (and IMHOattacking is objectively the best way to play chess),

requires a lot of chessknowledge.

Such a cpu will need at least 30 cycles to implement all algorithmicknowledge and sequential enhancements,

as well as that huge evaluation function.

So the entire search would be inside 1 cpu, with as most importantfeature of the chip having a hashtable in hardwareof at least 512KB. That gives a 300 million nps monster. It would bea real big CPU by the way and only using 1 makessense. Putting several of them in a cluster doesn't make sense. Youwant the entire search in hardware.If you're not doing that, then it is better to not make a cpu at all,except from marketing viewpoint.

Chess and also Go at those search speeds are just too dependant uponhashtable to not use it. My experiments using 460 processorsand giving each processor just 1MB of hashtable indicated that evensuch a tiny hashtable still performed very well. In fact it just lost1 ply over using a 200 fold bigger hashtable, this at a singleposition of 10 hours in total.


Would a go-cpu also be useful not using a transpositiontable?
The answer is NO.

Is it complicated to make such a hardware go-cpu?
The answer is YES.

A SHARED transpositiontable (shared over all cpu's) in hardware has 3important effects when having a lot of cores

searching of a hardware cpu (say 4+):

a) you avoid really a lot of futile parallel searches to getaccomplished, with tens of processors or even more,

     this is really important

b) transposition cutoffs give a branching factor improvement givingan exponential speedupc) storing the best move and using this in the search really improvesthe move ordering a lot,in fact it cloacks the bad move ordering that a hardware designhas a lot; it gives a huge branchingfactor improvement, far more than we get in software from itand therefore a huge exponential speedup,

Just having a transpositiontable of 512KB to 1MB in a hardware cpu,so ON chip, gives a hardware chess/go-cpu

nearly the same branching factor like you can achieve in software.

AFAIK no one ever has achieved this so far.

You can argue that hydra gets 220 million nps or so. Deep Blue onpaper got a 100 million nps or so, but that's justpaper. At least from Chrilly we know he's not lying too much abouthis nps, maybe.

With Deep Blue i've heard too many numbers and their search

depth was just too little to realistically put more than just a fewpercent of all 480 cpu's to work;if you get 10 ply in total from which 4 is hardware, you have tosplit at a depth of 6 ply all cpu's,it is very tough to get 480 hardware chips to work there. The "goodbranching factor", of just above 4, from deep bluemoving from 10 to 12 ply, is just because it can put more and moreprocessors to work at 11 and 12 ply.

Yet you can argue that Hydras 220 mln nps got used ugly bad.DeepSjeng at 4 million nps is getting the same searchdepth at a 2.4Ghz quadcore (overclocked a tad to nearly 3Ghz). I knowDeepSjeng's search is comparable in dubiousity to Hydra's.


So somewhere Hydra loses a factor 50 search efficiency.
Note that this is not uncommon.
Deep Blue lost way more than that.

That is not a surprise to me. Making a program run well on acombination of different architectures (hardware fpga cpu and at acluster),is very difficult. He had to do it all himself. And as usual when youdo not visit ICGA events and do not talk in restaurants/pubs with otherchessprogrammers at the many other computer chess events, you soonare not up to date with the latest search algorithms and tricks.

Of course once you choose for a certain setup, knowing it takes ayear or 5 to mature the hardware and software, the scene has probablychanged algorithmically bigtime by then. It is not easy then to justchange the entire search concept i guess.

However all this is not so relevant. We will not see Hydra in anyevent because of the second marketing principle:"If you cannot be first in a category, then create a new categorywhere you can be first"

        (Al Ries & Jack Trout)

The category where Chrilly and the sheikh picked to be first was:"the unbeatable chess machine".That means of course they can join never an event where otherchessprogrammers have good hardware.The risk of losing a number of games and not winning the title in an'unbeatable manner' is too big there.

In fact i'd argue it is 99%, if i take the ceiling.
That type of event is called world championship.

Sometimes people show up at supercomputers, usually the programmersfrom big nations have some great high clocked machine,

and nearly all programs are in big shape and kick butt at such event.

So the odds of winning that title is not so big for a chess machinethat gets outsearched by other opponents and has less chessknowledgethan them.

Computer go right now is at a phase, where the programmers who slowlylearned how to make an evaluation function, now get kicked buttby better search algorithms. Also not too many cores get thrown intoaction yet.

So i'd argue that the time is there that some sort of hardware/software professional should show up in computer-go,paid by some rich sponsor who wants to have a shot at being the firstbeating a strong professional player.


I'd argue it is best to talk to Chrilly for that.

For such a sponsor to get a kick butt go-machine accomplished,Chrilly is the best choice by far. Algorithmically he's good enoughto not bethat much behind the best and he has learned what to not do inhardware. Maybe next machine doesn't lose a factor 50 somewhere.

From my viewpoint by far most important advantage of Chrilly is:he's not keeping his mouth shut about what he algorithmically is doing.For science this is very important. Guys like Bushinsky and Meyer-Kahlen tell nothing about what they do inside their engine.

Chrilly is a honest person there.

Means his go playing machine might kick major butt for a short while.Of course after his machine has a reputation,he'll not show up of course at computer-go events anymore and he'llspend a few years playing different professional goplayers whose go playing capabilities will get dramaticallylobotomized by the bad oversight at the board, caused by the dollarsigns in the eyes.

However when such professionals show up in the ICGA Olympiads, thenyou'll figure out ICGA is a genius organisation.Though all amateurs spit at it, as they see person A get paid andperson B doesn't which they find unfair, meanwhile the ICGA themselvesgrabbing even more (hint: try to figure out what they charge as an'organisation fee'). All that doesn't matter. Everything is negotiableand during the event the ICGA has the one person that no other has.ICGA has Jaap van den Herik as tournament director.He is a genius in getting rivalling commercial professionals at 1table and keep the peace.


Vincent

On Jul 2, 2008, at 9:12 PM, Ian Osgood wrote:

On Jul 2, 2008, at 10:31 AM, Zach Wegner wrote:
On Tue, Jul 1, 2008 at 4:33 PM, Erik van der Werf
<[EMAIL PROTECTED]> wrote:
That's a pretty good deal!!!
http://64.68.157.89/forum/viewtopic.php?topic_view=threads&p=193819&t=21591
Why isn't there any sponsoring like this for the other tournaments?

Erik
They pretty much have to. The ICGA has achieved a rather lousy
reputation in the chess community, and very few participants are
showing up nowadays (11 on the list now). Compare that to the online
tournaments, which always have around 30 or more participants.
Personally I'd like to go and meet some other programmers, but there
are so few. And even after the subsidies it would still be very
expensive...
In my opinion, the size of the ICGA World Computer ChessChampionship event is irrelevant. More importantly, it has theprestige to consistently draw the top candidate programs. Thisyear, past champions Rybka, Junior, Shredder, and HIARCS will becompeting for the title. (The author of Zappa no longer developshis program, so I'm not surprised he dropped out. I don't know whyFritz never attends. Hydra would also be an interesting participantas one of the last custom chess supercomputers.)
By contrast, the ICGA Go events never get top candidate programparticipation, and before this year have had smaller turnouts thanthe chess event. Since the expiration of the Ing Prize, the lastevent of any kind which had such participation was the 2003 GifuChallenge (KCC Igo, Haruka, Go++, Goemate, Many Faces, GNU Go, GoIntellect, Aya, Katsunari). The size of this year's event isencouraging, but where are Go++, Haruka, HandTalk, and GNU Go? Andwhat ever happened to Wulu and GoAhead?
Ian

_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/


_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Re: [computer-go] 2008 World 9x9 Computer Go Championship in Taiwan

Reply via email to