On Tue, 25 Aug 2009, Dennis German wrote:
email with this content:
CONGRATULATION YOUR EMAIL ADDRESS HAS WON YOU THE 2010 FIFA WORLDCUP LOTTER=
Y OPEN THE ATTACHMENT AND VIEW THE PROFILE OF YOUR WINNING FUND=2C ALSO CON=
TACT YOUR CLAIM AGENT
received these scores
X-Spam-testscores: BAYES_00=-2.599,HTML_MESSAGE=0.001,MISSING_HEADERS=5.7,
SUBJ_ALL_CAPS=3.1,UPPERCASE_75_100=1.528
Does this indicate that bayes needs tuning/learning?
Can you paste the output from "sa-learn --dump magic" ?
It probably indicates that Bayes has been mistrained - somebody is
training spammy messages as ham.
How do you do your Bayes training? Autolearning, or purely manual, or some
combination?
How many messages are getting inappropriate Bayes scores? If a lot are,
you'll probably want to turn off autolearning (if you're using it) until
you analyze the problem. You may need to wipe your Bayes database and
start fresh if the problem is bad enough.
If you're using autolearning, what are your learning thresholds?
If you're manually training, do you keep your corpora so that you can
review and correct errors? If so, review your ham corpora and see if any
spams have crept in - and if so, retrain them as spam, SA will forget that
they were hammy.
--
John Hardin KA7OHZ http://www.impsec.org/~jhardin/
jhar...@impsec.org FALaholic #11174 pgpk -a jhar...@impsec.org
key: 0xB8732E79 -- 2D8C 34F4 6411 F507 136C AF76 D822 E6E6 B873 2E79
-----------------------------------------------------------------------
If someone has a gun and is trying to kill you, it would be
reasonable to shoot back with your own gun.
-- the Dalai Lama, May 15, 2001
-----------------------------------------------------------------------
Today: the 1930th anniversary of the destruction of Pompeii