Darren Cook: <4b0c6706.7070...@dcook.org>: >> Also, on 19x19 board, current 16-core cluster version performs almost >> the same as 8-core shared memory pc such as Mac Pro, which Yamato used >> for KGS. > >Hi Hideki, >Is that difference due to a scaling limit of Zen, or is this due to the >cluster overhead? Would moving from gigabit to infiniband help, or is >the limit more to do with the lack of shared memory?
I'm right now evaluating the scaling (:-). The performance gap is perhaps due to the algorithms. Almost all cluster versions of current strong programs (MoGo, MFG, Fuego and Zen) use root parallel while shared memory computers allow us to use thread parallelism, which gives better performance. The main reason, I guess, is that the latter increses the depth of the search tree according to the number of processors (cores) while the former does not. One interesting observed thing of root parallel is that the scaling depends on the time for a move; longer time setting shows better scalability, when the time period to exchange root information is fixed. In other words, each time setting has its best number of nodes. This makes things complicated :(. The scaling limit of Zen is still unknown, though I expected that the playouts of Zen was not so random that it did not scale well, before starting this joint project with Yamato. >>T2K HPC cluster > >This seems to be a cluster specification rather than an actual machine. >Can you tell us more about how many cores you are experimenting with, >and how the programs scale? (Are all your experiments with Zen, or are >you trying to run other programs on a cluster too?) I'm running only Zen on the cluster, though I'd like to run my Fudo Go as well if I have (had?) time. Name: T2K Open Supercomputer (Todai) #Todai is an abbreviation of University of Tokyo in Japenese. Hardware: HITACHI HA8000-tc/RS425 Number of nodes: 952 Number of cores of each node: 16 #I can use up to 64 nodes; 1024 cores in total Processor: AMD Opteron 8356 (quad-core) 2.3 GHz Memory of each node: 32 GB Interconnect: Myricom Myri-10G Operating System: RedHat Enterprise Linux 5 #Flops numbers are omitted. :) http://www.cc.u-tokyo.ac.jp/service/ha8000/intro.html (in Japanese) T2K stands for Tokyo, Tsukuba and Kyoto (T, T, K). See http://www.open-supercomputer.org/ (in English) for the idea of T2K Open Supercomputer. Hideki -- g...@nue.ci.i.u-tokyo.ac.jp (Kato) _______________________________________________ computer-go mailing list computer-go@computer-go.org http://www.computer-go.org/mailman/listinfo/computer-go/