Yesterday I noticed an odd phenomena in Valkyria wich was caused by high selectivity and AMAF.

In this position

(;GM[1]FF[4]SZ[9]AP[SmartGo:1.4]
KM[7.5]
;B[ee];W[de];B[ed];W[df];B[ef];W[dd];B[dg];W[ge];B[dc];W[cc];B[cd];W[bd]
;B[cb];W[ce];B[bc];W[bb];B[cd];W[cg];B[cc];W[bf];B[gd];W[hd];B[hf];W[gc]
;B[gf];W[fd];B[fb];W[gb];B[fc];W[he];B[fe];W[ib];B[dh];W[ch];B[ad];W[be]
;B[di];W[ci];B[ha];W[ga];B[ie];W[id];B[if];W[fa];B[gd];W[hb];B[eg];W[eb]
;B[ec];W[da];B[ca];W[ae];B[ic])
there are only two possible moves 1) capturing the last black stone played or 2) capture the ko. Only capturing the ko wins.

In this position valkyria will first search 1) because capturing the last stone is urgent. But the search locks into to that move only because there are a strong bias against move 2) in the AMAF evaluation for Valkyria. I guess what happens with AMAF is that alternative local moves (local relative to the first move in a repeated sequence) will always be biased downwards. This is so because playing the alternative local moves after the first one is played is often inefficient because of a duplication of effort.

Then since there is nothing in the AMAF scores that indicate that move 2 is any good it is never searched, since the search is so selective that the bounds will not grow large to compensate for the bias in a reasonable time.

I do not expect your programs to have the same problem in this particular position. But the problem could be general and I am curious if you have solutions for it if you do.

I did implement a crude solution that works in this position and did not make Valkyria lose rating overnight.

I added
IF (#TotVisits > 500) AND (#Visits > 0.9*#TotVisits) THEN
  uctQ := 0.5*uctQ;

before computing
Q := beta*virQ + (1-beta)*uctQ;
the effect of this is simply that as soon as a move has been played more than 90% of the time after at least 500 moves has been played in the position then other moves has to be played. This has to the drawback that the search gets slightly inefficient when it finds forced moves. One reason I have this problem may be that I bias the Q value towards local shapes after it has been computed. I should perhaps put weight on high priority patterns by adjusting the prior value of the AMAF instead. I realized this when I read the latest easy-to-read paper from the MOGO team and I will test that as well.

-Magnus
_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Reply via email to