Yamashita san, Go version in AlphaZero 2017 finished the training in 34 hours according to Table S3. And it looks like AlphaZero Symmetries in AlphaZero 2018 finished the training in the same time according to Figure S1. So I think that the authors had adopted AlphaZero Symmetries in 2017 paper by mistake and retried the experiment again in 2018 paper. In order to compensate symmetries with real self-plays, they generated 8 times more games and reduced positions per game to 1/8. It is just my guess^^ - ICHIKAWA Yuji
> 2019/03/29 10:11、Hiroshi Yamashita <y...@bd.mbn.or.jp>のメール: > > Hi, > > Number of learned positions from a game record > > pos steps minibatch games > AlphaGoZero 293 ( 700,000 * 2048) / 4,900,000 3 > days > AlphaGoZero 219 (3,100,000 * 2048) / 29,000,000 256 x 40 block, 40 > days > AlphaZero 2017 137 ( 700,000 * 4096) / 21,000,000 > AlphaZero 2018 20 ( 700,000 * 4096) / 140,000,000 > ELF 2019 154 (1,500,000 * 2048) / 20,000,000 > AlphaZero(Chess) 65 ( 700,000 * 4096) / 44,000,000 > AlphaZero(Shogi) 119 ( 700,000 * 4096) / 24,000,000 > > All Network is 256 x 20 blocks, except AlphaGoZero 40 days. > > Average of game moves are > Go 220 > Chess 80 > Shogi 120 > > So I had thought learning all positions(from a game) once is nice. > But AlphaZero2018 uses only 20 positions from a game. > > > By the way, I did not received any mails since Ingo's mail(Mar 1 2019). > > Erik reported in Feb 17 2019, >> It looks like gmail is broken again for this list. I never got Remi's > > Remi also reported in Mar 24 2019. (I found this from archives.) >> I have just found out that the list is not sending emails to my free.fr > > Thanks, > Hiroshi Yamashita > _______________________________________________ > Computer-go mailing list > Computer-go@computer-go.org > http://computer-go.org/mailman/listinfo/computer-go _______________________________________________ Computer-go mailing list Computer-go@computer-go.org http://computer-go.org/mailman/listinfo/computer-go