Re: SpamAssassin false positive bayes with attachments

2014-10-06 Thread Joe Albertson
After reading your reply, I re-examined the message and found the case was an incorrect Content-Type: ~~~ Content-Type: text/plain; charset=windows-1250; name="pdfname.pdf" Content-Transfer-Encoding: base64 Content-Disposition: attachment; filename="pdfname.pdf" ~~~ So it was scanning the base64

Re: SpamAssassin false positive bayes with attachments

2014-10-06 Thread David F. Skoll
On Mon, 06 Oct 2014 21:28:02 +0200 Karsten Bräckelmann wrote: > Unless the message's MIME-structure is severely broken, these tokens > appear somewhere other than a base64 encoded attachment. Agreed, and a Qmail bounce message is a prime example of a message whose MIME structure is "severely bro

Re: SpamAssassin false positive bayes with attachments

2014-10-06 Thread Karsten Bräckelmann
On Mon, 2014-10-06 at 09:03 -0400, jdime abuse wrote: > I have been seeing some issues with bayes detection from base64 > strings within attachments causing false positives. > > Example: > Oct 6 09:02:14.374 [15869] dbg: bayes: token 'H4f' => 0.71186828264 > Oct 6 09:02:14.374 [15869] dbg: b

Re: SpamAssassin false positive bayes with attachments

2014-10-06 Thread Benny Pedersen
On October 6, 2014 3:03:30 PM jdime abuse wrote: I have been seeing some issues with bayes detection from base64 strings within attachments causing false positives. Train more data then, bayes needs more data to prevent it Example: Oct 6 09:02:14.374 [15869] dbg: bayes: token 'H4f' => 0.99

SpamAssassin false positive bayes with attachments

2014-10-06 Thread jdime abuse
I have been seeing some issues with bayes detection from base64 strings within attachments causing false positives. Example: Oct 6 09:02:14.374 [15869] dbg: bayes: token 'H4f' => 0.71186828264 Oct 6 09:02:14.374 [15869] dbg: bayes: token 'wx2' => 0.68644662127 Oct 6 09:02:14.374 [15869]