AW: [SAtalk] Dump bayes db please explain the columns

mrv Mon, 19 Jan 2004 04:16:47 -0800

hi,

>-----Ursprüngliche Nachricht-----
>Von: [EMAIL PROTECTED]
>Gesendet: Montag, 19. Jänner 2004 11:20
>
>I ran into the same problem, and was unable to find any
>documentation ... but
>here is my guess of what the columns mean:
>
>sa-learn --dump data | sort -n > /tmp/asdf
>
>I sorted the output for a reason:
>
>0.995         10          0 1074305205  U*p6618-qp2sam
>0.995         10          0 1074305205  sk:p6618-q
>0.995         10          0 1074308138  270
>0.995         10          0 1074308138  avoiding
>0.995         10          0 1074308138  elsewhere
>0.995         10          0 1074310744  Forfeiture
>0.995         10          0 1074310744  Notify
>0.995         10          0 1074310744  g2.gif
>
>1st: low equals hammy, high equals spammy
>2nd: roughly equal # of occurrence of that particular token
>     learnt as spam
>3rd: roughly equal # of occurrence of that particular token
>     learnt as ham
>4th: # of seconds since 1970 ... (a Unix tradition of
>     measuring time in # of seconds since 1970)
>5th: the token itself.
>
>Disclaimer: I'm not a developer ...
>
>Pedro
>


i agree with you, i would call the 2nd and 3rd columns 'multiplier'.

if a token is matched as spam
(e.g.: 0.995 10  0 1074310744  Notify)

then i have to multiply the number of the 2nd column 10 with 0.995
and add it to the summary of bayes value.

if the token is matched as ham then i multiply it again with third
column but i decrease it from the bayes value.

do you know what i mean?

i hope it will be this way as i mentioned it.

AND that i can add / remove tokens if i wish.

regards
Andrew




-------------------------------------------------------
The SF.Net email is sponsored by EclipseCon 2004
Premiere Conference on Open Tools Development and Integration
See the breadth of Eclipse activity. February 3-5 in Anaheim, CA.
http://www.eclipsecon.org/osdn
_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

AW: [SAtalk] Dump bayes db please explain the columns

Reply via email to