On 10/18/11 12:12 PM, "Karsten Bräckelmann" wrote:
> On Tue, 2011-10-18 at 07:53 -0500, Daniel McDonald wrote:
>> One of my users submitted a spam for analysis, and I was amazed at the
>> efforts this troglodyte expended to poison bayes.
>> Is it worth the effort to try to find huge html comme
On Tue, 2011-10-18 at 07:53 -0500, Daniel McDonald wrote:
> One of my users submitted a spam for analysis, and I was amazed at the
> efforts this troglodyte expended to poison bayes.
> Is it worth the effort to try to find huge html comments hiding junk
> like this?
Hmm, wait -- Bayes and HTML com
Daniel McDonald wrote:
Rawbody OBFU_HTML_LONG_COMMENT /\<--.{1024,}?--\>/
Describe OBFU_HTML_LONG_COMMENT contains a ridiculously long html comment
Tried with exactly that limit, 1 kb.
TargetX, which is used by universities in recruiting, uses a long comment
in its generated mail (I did no
On 10/18/2011 8:53 AM, Daniel McDonald wrote:
> One of my users submitted a spam for analysis, and I was amazed at the
> efforts this troglodyte expended to poison bayes.
> Is it worth the effort to try to find huge html comments hiding junk
> like this?
>
> Maybe something like
>
> Rawbody OBFU_HT
maillist wrote:
I see a few emails every-now-and-then about "bayes poisoning", and am
wondering what is means. From what I understand, it is some message
that gets learned (only through autolearn?) that has certain
characteristics that throw the bayes system off.
From what I've seen there
Peter Smith wrote:
> > > The messages are simply a random stream of words, with punctuation
> > > scattered in them. No HTML, no URLs being advertised, no excessive
> > > capitalisation, just meaningless text.
>
> I'm cautious about feeding these messages to sa-learn as spam, in
> case it has a ne
The best thing to do is probably throw the current database away and start
over. As you seem to have several users, you should have bayes working
again within a very few hours, or less.
You should delete the current database, reset the scores to normal (and
increase the bayes_99 score to somethin