On 3/17/15, David Silver <davidstarsil...@gmail.com> wrote:
> Reinforcement learning is different to unsupervised learning. We used
> reinforcement learning to train the Atari games. Also we published a more
> recent paper (www.nature.com/articles/nature14236) that applied the same
> network to 50 different Atari games (achieving human level in around half).

Omg, the Atari paper is an awesome paper.  Cant believe I skipped over
it the first time.  I guess I was like "Oh, it's not Go, skip that one
for now :-)" :-D

It's really amazing, it's exactly what I was hoping someone could
achieve.  Well... "exactly"... I suppose "exactly" would mean, could
learn to play http://springrts.com :-)  Perhaps, "conceptually" is a
better word.  My idea was to just give the computer generic,
unlabelled, arrays as input, representing the map and stuff; and a set
of generic, unlabelled buttons as output, ie representing 'up' 'down',
etc.  But I had no idea how to train it :-)  And now, someone has come
up with a way to train such a device :-)
_______________________________________________
Computer-go mailing list
Computer-go@computer-go.org
http://computer-go.org/mailman/listinfo/computer-go

Reply via email to