Re: lottery message scored hammy by bayes

John Hardin Tue, 25 Aug 2009 18:13:20 -0700

On Tue, 25 Aug 2009, Dennis German wrote:

email with this content:


CONGRATULATION YOUR EMAIL ADDRESS HAS WON YOU THE 2010 FIFA WORLDCUP LOTTER=
Y OPEN THE ATTACHMENT AND VIEW THE PROFILE OF YOUR WINNING FUND=2C ALSO CON=
TACT YOUR CLAIM AGENT

received these scores

X-Spam-testscores: BAYES_00=-2.599,HTML_MESSAGE=0.001,MISSING_HEADERS=5.7,
   SUBJ_ALL_CAPS=3.1,UPPERCASE_75_100=1.528

Does this indicate that bayes needs tuning/learning?


Can you paste the output from "sa-learn --dump magic" ?

It probably indicates that Bayes has been mistrained - somebody istraining spammy messages as ham.

How do you do your Bayes training? Autolearning, or purely manual, or somecombination?

How many messages are getting inappropriate Bayes scores? If a lot are,you'll probably want to turn off autolearning (if you're using it) untilyou analyze the problem. You may need to wipe your Bayes database andstart fresh if the problem is bad enough.


If you're using autolearning, what are your learning thresholds?

If you're manually training, do you keep your corpora so that you canreview and correct errors? If so, review your ham corpora and see if anyspams have crept in - and if so, retrain them as spam, SA will forget thatthey were hammy.


--
 John Hardin KA7OHZ                    http://www.impsec.org/~jhardin/
 jhar...@impsec.org    FALaholic #11174     pgpk -a jhar...@impsec.org
 key: 0xB8732E79 -- 2D8C 34F4 6411 F507 136C  AF76 D822 E6E6 B873 2E79
-----------------------------------------------------------------------
  If someone has a gun and is trying to kill you, it would be
  reasonable to shoot back with your own gun.
                                      -- the Dalai Lama, May 15, 2001
-----------------------------------------------------------------------
 Today: the 1930th anniversary of the destruction of Pompeii

Re: lottery message scored hammy by bayes

Reply via email to