On Tue, 29 Mar 2011, Max wrote:

For a while we were getting spam messages that had images embedded as text and not an attachment. Those are marked as spam but couldn't the random characters of the image data increase the entropy of the database and cause some less than definitive scores?

I'm pretty sure that the content of images does not affect Bayes.

That aside. It seems like all my ham is bellow 0 so would changing the cut off to something like 2.0 be bad practice?

In general it is not recommended.

All of the score generation is done with the assumption the threshold is 5 points. If you lower your threshold you are risking an increase in FPs.

--
 John Hardin KA7OHZ                    http://www.impsec.org/~jhardin/
 jhar...@impsec.org    FALaholic #11174     pgpk -a jhar...@impsec.org
 key: 0xB8732E79 -- 2D8C 34F4 6411 F507 136C  AF76 D822 E6E6 B873 2E79
-----------------------------------------------------------------------
   "A well educated Electorate, being necessary to the liberty of a
    free State, the Right of the People to Keep and Read Books,
    shall not be infringed."
  ...means only registered voters can read books, and only those books
  obtained with State permission from State-controlled bookstores?
-----------------------------------------------------------------------
 Today: the M1911 is 100 years old - and still going strong!

Reply via email to