[Computer-go] AlphaGo's Endgame Mistakes

Robert Jasiek Sat, 19 Aug 2017 04:22:40 -0700

Reading Invisible, it is apparent that AlphaGo makes score-relatedmistakes in the endgame, ko fights or virtual ko fights (read: wastingko threats) occurring during the early endgame if AlphaGo winsnevertheless. So we cannot say yet that they would be win-related (orwinning-probability-related) mistakes. AlphaGo plays better endgame ifit needs to. The score-related mistakes are easily explained in terms oftraditional human go theory or more clearly in terms of formal go theoryusing the score-related view (larger score is better than smaller scorein perfect play with perfect information).

So far, it seems unknown whether AlphaGo might also make some of thosemistakes when its win is still unclear (winning probability near 50%).

Improving AlphaGo's play WRT to the score-related mistakes seemsstraightforward: first create moves as currently, then dynamicallyiterate komi increments for specific positions during the games andcreate a second instance of AlphaGo modified due to its improved playwith tougher komi.


--
robert jasiek
_______________________________________________
Computer-go mailing list
Computer-go@computer-go.org
http://computer-go.org/mailman/listinfo/computer-go

[Computer-go] AlphaGo's Endgame Mistakes

Reply via email to