Re: Increase in Image Spam

Amir Caspi Thu, 20 Feb 2014 11:35:26 -0800

On Feb 20, 2014, at 11:21 AM, Kris Deugau <kdeu...@vianet.ca> wrote:

> Have you tried learning one specific FN, then reprocessing that message
> to see what Bayes score it gets?  IME it will usually shift from
> BAYES_00 to at least BAYES_40 in most cases, even with a large sitewide
> DB with far more tokens than the usual per-user DB.


Well, I just tried this, and sa-learn seems to be refusing to learn the 
messages.  I've placed an example MBOX here, temporarily (I will delete this 
within the next 24-48 hours for security):

https://www.dropbox.com/s/m4fuv670wnvwa16/SA_testspam.mbox

When I run sa-learn on this mailbox, it says:

Learned tokens from 0 message(s) (0 message(s) examined)

(This is using SA 3.3.2 on a CentOS 5.10 box.)

I tried placing other spam in here and it learned those fine, so clearly 
something about these two messages is confusing sa-learn.

Anyone have an idea why sa-learn is refusing to even examine these messages?

(Note that the messages are out of order; the first one is newer than the 
second.  The older one scored Bayes_50, the newer one scored Bayes_00.)

Any thoughts are greatly appreciated, I don't know why sa-learn won't even 
touch these... and that may explain why they continue to have low scores!

--- Amir

Re: Increase in Image Spam

Reply via email to