I took lessons from a Korean pro, Myungwan Kim; one of which was an analysis of
choices made in yose, which was solidly based upon a probabilistic model of
expected gains. Don't underestimate the mathematical skills of Go
professionals.
Terry McIntyre <[email protected]>
Unix/Linux Systems Administration
Taking time to do it right saves having to do it twice.
>________________________________
> From: Don Dailey <[email protected]>
>To: [email protected]
>Sent: Thursday, January 19, 2012 11:23 AM
>Subject: Re: [Computer-go] Computer Go and EGC 2012
>
>
>
>
>On Thu, Jan 19, 2012 at 10:27 AM, John Tromp <[email protected]> wrote:
>
>On Thu, Jan 19, 2012 at 9:59 AM, "Ingo Althöfer" <[email protected]> wrote:
>>
>>> What I mean is the following:
>>> The human player (in 19x19) determines how many handicap stones he gives
>>> to the bot (either 2 or 3 or 4 or 5).
>>>
>>> When he selects a difficult task (=giving high handicap) he gets high
>>> reward in case of a win. When he selects a more easy task (= giving low
>>> handicap) he gets only small reward in case of success. So, the list
>>> from the original posting (300 Euro bonus in case of a win at 5 handicap
>>> stones) is what I mean.
>>>
>>> By giving him this choice under the reward system, I want to learn
>>> what this special person thinks about his/her chances.
>>
>>Considering that the smallest handicap at which computers have beaten pros
>>is 5 or 6, few pros would want the embarrassment of choosing less.
>>Being the first pro to lose a 4 handicap game is not a particularly
>>attractive prospect:-(
>>
>>I expect they will blindly go with the 6 handicap option, mostly due
>>to professional pride,
>>but also because of the higher reward and because there is less embarrassment
>>(even if a higher chance of) losing at the higher handicap.
>>
>
>
>
>
>Of course this can be influenced by the incentive, right? I think if a
>pro were offered a million dollars to beat the computer with a handicap that
>only gives him a 10% chance of winning he would take it over a sure win that
>only returned $50.
>
>
>The expectation of winning for each stone handicap should be computed and the
>incentive for the human should be chosen to make it more attractive to play
>the higher handicap matches. This should be clearly explained to the pro
>before proceeding so that he understands that the higher stone handicap is a
>better bargain but the lower handicap is a more certain payoff.
>
>
>Unfortunately I think you point still holds unless the pro thinks like a
>scientist or mathematician. There is a factor called "risk aversion" which
>is a psychological concept and is different for each person. It says that
>people tend to reject a clear bargain if it has a less certain likelihood of
>payoff. In this case their investment is not monetary but in the form
>of embarrassment and pain, thus they are likely to be extremely "risk averse"
>as you intuit here.
>
>
>But I have a solution that should change the players
>thinking significantly and cause him to make a slightly better decision.
>The first step of course is to make it a bargain to play higher handicap
>matches. The SECOND stage of the solution is to offer him different
>incentives for losing. You can make this a "game" in which his risk
>aversiveness (is that word?) is a minor factor if he always comes away with
>something. He should get more money for losing a high handicap match but
>still not enough that it is better to lose a high handicap match than win a
>low handicap match.
>
>
>Here is an example which may say something about your own risk averseness for
>anyone who has not heard of this concept. You could also add an additional
>zero the monetary amounts in this example to see if you would change as this
>increases the risk:
>
>
>Which would you pay $10 for - if given only these 2 choices? :
>
>
> 1. 1 out 2 chance of getting $100.00
>
>
> 2. 1 out of 2000 chance of getting $100,000
>
>
>Both would be a bargain, because each are worth exactly $50 and you are asked
>only to pay $10. Some people are so risk averse they would not pay $10
>for a 50/50 chance of winning $100.00 simply because they fear losing $10.
>However most people would immediately see this a bargain or opportunity.
>
>
>If you choose option 1 you have an excellent chance (50/50) of going home $90
>richer. If you choose option 2 you will almost certainly go home $10 poorer,
>but if you win you win really big. Many people don't like the idea of
>taking a chance and losing $10 (this depends of course on their personality
>and their income) and are likely to choose option 1. Many would see
>option 2 as stupidity, a way to give away $10 even though it's actually a
>bargain.
>
>
>The fact is however that both these options have equal value, an expectancy of
>$50. In other words if you played this game every day of your life you
>could expect to average about $40 per day on average to have a $40 a day
>extra income on average or about 1 million dollars over a 70 year lifetime.
> The second option is more granular and you might end up with substantially
>more than 1 million dollars or substantially less than 1 million after your 70
>years. Which would you choose if you had to play this game every day?
>Would you go for the very steady stream of income or the big 100,000 payday
>which you might only see a few times in your lifetime? The second choice
>gives you a chance to do much better but risks doing much worse over a
>lifetime.
>
>
>Don
>_______________________________________________
>Computer-go mailing list
>[email protected]
>http://dvandva.org/cgi-bin/mailman/listinfo/computer-go
>
>
_______________________________________________
Computer-go mailing list
[email protected]
http://dvandva.org/cgi-bin/mailman/listinfo/computer-go