Re: Bayes Poisoning

2011-10-18 Thread Daniel McDonald
On 10/18/11 12:12 PM, "Karsten Bräckelmann" wrote: > On Tue, 2011-10-18 at 07:53 -0500, Daniel McDonald wrote: >> One of my users submitted a spam for analysis, and I was amazed at the >> efforts this troglodyte expended to poison bayes. >> Is it worth the effort to try to find huge html comme

Re: Bayes Poisoning

2011-10-18 Thread Karsten Bräckelmann
On Tue, 2011-10-18 at 07:53 -0500, Daniel McDonald wrote: > One of my users submitted a spam for analysis, and I was amazed at the > efforts this troglodyte expended to poison bayes. > Is it worth the effort to try to find huge html comments hiding junk > like this? Hmm, wait -- Bayes and HTML com

Re: Bayes Poisoning

2011-10-18 Thread Joseph Brennan
Daniel McDonald wrote: Rawbody OBFU_HTML_LONG_COMMENT /\<--.{1024,}?--\>/ Describe OBFU_HTML_LONG_COMMENT contains a ridiculously long html comment Tried with exactly that limit, 1 kb. TargetX, which is used by universities in recruiting, uses a long comment in its generated mail (I did no

Re: Bayes Poisoning

2011-10-18 Thread Bowie Bailey
On 10/18/2011 8:53 AM, Daniel McDonald wrote: > One of my users submitted a spam for analysis, and I was amazed at the > efforts this troglodyte expended to poison bayes. > Is it worth the effort to try to find huge html comments hiding junk > like this? > > Maybe something like > > Rawbody OBFU_HT

Re: bayes poisoning

2007-01-16 Thread Chris Purves
maillist wrote: I see a few emails every-now-and-then about "bayes poisoning", and am wondering what is means. From what I understand, it is some message that gets learned (only through autolearn?) that has certain characteristics that throw the bayes system off. From what I've seen there

RE: Bayes poisoning (was Re: your mail)

2006-09-27 Thread Bowie Bailey
Peter Smith wrote: > > > The messages are simply a random stream of words, with punctuation > > > scattered in them. No HTML, no URLs being advertised, no excessive > > > capitalisation, just meaningless text. > > I'm cautious about feeding these messages to sa-learn as spam, in > case it has a ne

Re: Bayes poisoning ?

2005-07-22 Thread Loren Wilton
The best thing to do is probably throw the current database away and start over. As you seem to have several users, you should have bayes working again within a very few hours, or less. You should delete the current database, reset the scores to normal (and increase the bayes_99 score to somethin