hi, >-----Ursprüngliche Nachricht----- >Von: [EMAIL PROTECTED] >Gesendet: Montag, 19. Jänner 2004 11:20 > >I ran into the same problem, and was unable to find any >documentation ... but >here is my guess of what the columns mean: > >sa-learn --dump data | sort -n > /tmp/asdf > >I sorted the output for a reason: > >0.995 10 0 1074305205 U*p6618-qp2sam >0.995 10 0 1074305205 sk:p6618-q >0.995 10 0 1074308138 270 >0.995 10 0 1074308138 avoiding >0.995 10 0 1074308138 elsewhere >0.995 10 0 1074310744 Forfeiture >0.995 10 0 1074310744 Notify >0.995 10 0 1074310744 g2.gif > >1st: low equals hammy, high equals spammy >2nd: roughly equal # of occurrence of that particular token > learnt as spam >3rd: roughly equal # of occurrence of that particular token > learnt as ham >4th: # of seconds since 1970 ... (a Unix tradition of > measuring time in # of seconds since 1970) >5th: the token itself. > >Disclaimer: I'm not a developer ... > >Pedro >
i agree with you, i would call the 2nd and 3rd columns 'multiplier'. if a token is matched as spam (e.g.: 0.995 10 0 1074310744 Notify) then i have to multiply the number of the 2nd column 10 with 0.995 and add it to the summary of bayes value. if the token is matched as ham then i multiply it again with third column but i decrease it from the bayes value. do you know what i mean? i hope it will be this way as i mentioned it. AND that i can add / remove tokens if i wish. regards Andrew ------------------------------------------------------- The SF.Net email is sponsored by EclipseCon 2004 Premiere Conference on Open Tools Development and Integration See the breadth of Eclipse activity. February 3-5 in Anaheim, CA. http://www.eclipsecon.org/osdn _______________________________________________ Spamassassin-talk mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/spamassassin-talk