this actually uses your cpu to train the network
it looks like you can either make a deep learning network with simple
inputs or a shallow learning network with more complex inputs to solve
the spiral case (the only challenging one)
the shallower networks are much faster, but they don't converge to a 0
training loss, while the deep learning networks do
Computer-go mailing list