This game is a great example for the people that said that AlphaGo didnt play mi...

edraferi · on March 13, 2016

That doesn't make sense to me. Even if the objective function is win probability, it's used to order all potential moves. Thus given a menu of bad options, it should choose the least-bad one, not start choosing at random.

I think there's something more subtle going on.

eru · on March 13, 2016

Alphago only has an approximation of win probability, and to be more precise of `win probability playing against itself'. That works well in an even match against humans---but when far to the losing side, Alphago's win probability estimate is not very good.