On Tue, 25 Aug 2009, Dennis German wrote:

email with this content:

CONGRATULATION YOUR EMAIL ADDRESS HAS WON YOU THE 2010 FIFA WORLDCUP LOTTER=
Y OPEN THE ATTACHMENT AND VIEW THE PROFILE OF YOUR WINNING FUND=2C ALSO CON=
TACT YOUR CLAIM AGENT

received these scores

X-Spam-testscores: BAYES_00=-2.599,HTML_MESSAGE=0.001,MISSING_HEADERS=5.7,
   SUBJ_ALL_CAPS=3.1,UPPERCASE_75_100=1.528

Does this indicate that bayes needs tuning/learning?

Can you paste the output from "sa-learn --dump magic" ?

It probably indicates that Bayes has been mistrained - somebody is training spammy messages as ham.

How do you do your Bayes training? Autolearning, or purely manual, or some combination?

How many messages are getting inappropriate Bayes scores? If a lot are, you'll probably want to turn off autolearning (if you're using it) until you analyze the problem. You may need to wipe your Bayes database and start fresh if the problem is bad enough.

If you're using autolearning, what are your learning thresholds?

If you're manually training, do you keep your corpora so that you can review and correct errors? If so, review your ham corpora and see if any spams have crept in - and if so, retrain them as spam, SA will forget that they were hammy.

--
 John Hardin KA7OHZ                    http://www.impsec.org/~jhardin/
 jhar...@impsec.org    FALaholic #11174     pgpk -a jhar...@impsec.org
 key: 0xB8732E79 -- 2D8C 34F4 6411 F507 136C  AF76 D822 E6E6 B873 2E79
-----------------------------------------------------------------------
  If someone has a gun and is trying to kill you, it would be
  reasonable to shoot back with your own gun.
                                      -- the Dalai Lama, May 15, 2001
-----------------------------------------------------------------------
 Today: the 1930th anniversary of the destruction of Pompeii

Reply via email to