Re: [Computer-go] AlphaGo & DCNN: Handling long-range dependency

Robert Jasiek Fri, 11 Mar 2016 00:33:51 -0800

On 11.03.2016 08:24, Huazuo Gao wrote:

Points at the center of the board indeed depends on the full board, but
points near the edge does not.

I have been wondering why AlphaGo could improve a lot between the FanHui and Lee Sedol matches incl. learning sente and showing greater signsof more global, more long-term planning. A rumour so far suggests tohave used the time for more learning, but I'd be surprised if thisshould have sufficed. So far, I have the following theories:


- deeper net

- greater parameters for convolutional patterns (instead of 5x5 and 3x3,(also) use larger parameters) or combine the earlier parameters withadditional larger parameters or with an additional NN having only /mostly larger parameters

- replace or enhance top KGS games by 100,000+ pro games

- instead of / in addition to feed forward nets, use long short termmemory nets (but I cannot know if this is advantageous consideringpresumably greater GPU time)- instead of single position patterns, use combinations of currentposition and later positions, for different (dynamic) parameters of timeshift, so as to model long-term effects


--
robert jasiek
_______________________________________________
Computer-go mailing list
Computer-go@computer-go.org
http://computer-go.org/mailman/listinfo/computer-go

Re: [Computer-go] AlphaGo & DCNN: Handling long-range dependency

Reply via email to