On Wed, 29 Jun 2011, JKL wrote:

mysql> select count(spam_count) from bayes_vars;
+-------------------+
| count(spam_count) |
+-------------------+
|               185 |
+-------------------+

mysql> select count(ham_count) from bayes_vars;
+------------------+
| count(ham_count) |
+------------------+
|              185 |
+------------------+

That's not sufficient for Bayes to start scoring messages. The minimum is 200 each of ham and spam.

It's generally considered a good idea to train misses and to try to keep the ratio to something approaching your spam:ham ratio in email by training regular email that doesn't score really high or low. I keep mine at about 3:1 spam:ham tokens by training misses and anything in the 10-80% range.

--
 John Hardin KA7OHZ                    http://www.impsec.org/~jhardin/
 jhar...@impsec.org    FALaholic #11174     pgpk -a jhar...@impsec.org
 key: 0xB8732E79 -- 2D8C 34F4 6411 F507 136C  AF76 D822 E6E6 B873 2E79
-----------------------------------------------------------------------
  Pork (n): (political) The manifestation of the principle that it is
  a felony to bribe a legislator, unless you are also a legislator.
-----------------------------------------------------------------------
 5 days until the 235th anniversary of the Declaration of Independence

Reply via email to