On Wed, 29 Jun 2011, JKL wrote:
mysql> select count(spam_count) from bayes_vars;
+-------------------+
| count(spam_count) |
+-------------------+
| 185 |
+-------------------+
mysql> select count(ham_count) from bayes_vars;
+------------------+
| count(ham_count) |
+------------------+
| 185 |
+------------------+
That's not sufficient for Bayes to start scoring messages. The minimum is
200 each of ham and spam.
It's generally considered a good idea to train misses and to try to keep
the ratio to something approaching your spam:ham ratio in email by
training regular email that doesn't score really high or low. I keep mine
at about 3:1 spam:ham tokens by training misses and anything in the 10-80%
range.
--
John Hardin KA7OHZ http://www.impsec.org/~jhardin/
jhar...@impsec.org FALaholic #11174 pgpk -a jhar...@impsec.org
key: 0xB8732E79 -- 2D8C 34F4 6411 F507 136C AF76 D822 E6E6 B873 2E79
-----------------------------------------------------------------------
Pork (n): (political) The manifestation of the principle that it is
a felony to bribe a legislator, unless you are also a legislator.
-----------------------------------------------------------------------
5 days until the 235th anniversary of the Declaration of Independence