I was writing code along those lines when AlphaGo debuted. When it became clear that AlphaGo had succeeded, then I ceased work.
So I don’t know whether this strategy will succeed, but the theoretical merits were good enough to encourage me. Best of luck, Brian From: Computer-go [mailto:computer-go-boun...@computer-go.org] On Behalf Of Bo Peng Sent: Tuesday, January 10, 2017 5:25 PM To: computer-go@computer-go.org Subject: [Computer-go] Training the value network (a possibly more efficient approach) Hi everyone. It occurs to me there might be a more efficient method to train the value network directly (without using the policy network). You are welcome to check my method: http://withablink.com/GoValueFunction.pdf Let me know if there is any silly mistakes :)
_______________________________________________ Computer-go mailing list Computer-go@computer-go.org http://computer-go.org/mailman/listinfo/computer-go