Bart Schaefer <[EMAIL PROTECTED]> writes:

> score: ASCII_FORM_ENTRY 0.036 -> -1.660
> score: BUGZILLA_BUG -2.000 -> 0.921

BUGZILLA_BUG obviously needs to be fixed.  Maybe an eval would be best.

> score: DATE_MISSING 0.248 -> -2.140
> score: EXCUSE_16 1.345 -> -0.721
> score: FORGED_HOTMAIL_RCVD 0.530 -> -0.356
> score: FROM_AND_TO_SAME 0.877 -> -2.071
> score: FROM_NAME_NO_SPACES 0.500 -> -0.114
> score: GREEN_EXCUSE_1 3.116 -> -2.019
> score: INTL_EXEC_GUILD 0.781 -> -0.039
> score: MONEY_BACK 1.489 -> -0.239
> score: MONEY_MAKING 2.490 -> -0.687
> score: MSGID_CHARS_WEIRD 1.500 -> -2.178
> score: NO_REAL_NAME 0.632 -> -1.068
> score: X_NOT_PRESENT 0.500 -> -1.920

I'm removing X_NOT_PRESENT and MSGID_CHARS_WEIRD from both trees.  I
think FROM_NAME_NO_SPACES is a good rule that needs more tweaking.

Several are probably oddbads because they have low frequency.  I have
zero hits for DATE_MISSING.  Some others look recoverable.

Rule               spam good
ASCII_FORM_ENTRY     42    8
BUGZILLA_BUG          0  178
DATE_MISSING          0    0
EXCUSE_16           102    7
FORGED_HOTMAIL_RCVD 124    2
FROM_AND_TO_SAME     94  125
FROM_NAME_NO_SPACES 471   87
GREEN_EXCUSE_1        1    0
INTL_EXEC_GUILD       0    0
MONEY_BACK           41    0
MONEY_MAKING         19    0
MSGID_CHARS_WEIRD    11    1
NO_REAL_NAME        875  789
X_NOT_PRESENT       601  614

> How did these get exactly 1.0?  Not represented in the corpus at all?
> 
> score: FORGED_RCVD_TRAIL absent -> 1.000
> score: FROM_ADDRESS_EQ_REAL absent -> 1.000
> score: TO_ADDRESS_EQ_REAL absent -> 1.000

You're looking at HEAD.  These are new rules I added last night.

Dan

_______________________________________________________________

Don't miss the 2002 Sprint PCS Application Developer's Conference
August 25-28 in Las Vegas - 
http://devcon.sprintpcs.com/adp/index.cfm?source=osdntextlink

_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Reply via email to