Re: How SA reactes to a bunch of garbage characters

2016-06-15 Thread Olivier
Matus, >>To a part that would do regexp rules, but not Bayes? I don't know if it >>is possible. > > someone who knoes SA internals will have to answer this one, but I doubt > it's useful, see below. I will give a look at Bayes OCR, it does inject the text OCR'ed from an image into the body of th

Re: How SA reactes to a bunch of garbage characters

2016-06-15 Thread Olivier
RW, > I stopped using OCR a long time ago because I didn't find that image > spam was particularly hard to catch. These days I find that spams with > images are mostly either pictures of Russian girls or spoofed corporate > logos. Then you need something able to detect the amount of flesh on a p

Re: Penalizing code not working

2016-06-15 Thread spamassassin
And a convenient spam purporting to come from google verifies the second part of the test. Thanks again. :) -- Dave Stiles

Re: how to write body rules to match 'tortured html' variations of text phrases?

2016-06-15 Thread David B Funk
On Thu, 16 Jun 2016, RW wrote: On Wed, 15 Jun 2016 13:40:25 -0700 (PDT) John Hardin wrote: On Wed, 15 Jun 2016, jaso...@mail-central.com wrote: and all the possible line-broken and "="-delimited variations? There's obviously a lot of them. That would have to be a rawbody rule AFAIK QP i

Re: how to write body rules to match 'tortured html' variations of text phrases?

2016-06-15 Thread John Hardin
On Wed, 15 Jun 2016, Groach wrote: Here is a real example: body __MY_PHISH_CIRCUMVENT_ATTEMPT3 /((?!account)(\xD0\xB0|a)(\xD1\x81|c){2}(\xD0\xBE|o)u(\xD5\xB8|n)t|(?!customer)(\xE1\xB4\x84|c)u(\xD1\x95|S)t(\xD0\xBE|o)mer|(?!verif(y|i))ver(\xD1\x96|i)f((\xD1\x83|y)| (\xD1\x96|i)))/i (effec

Re: how to write body rules to match 'tortured html' variations of text phrases?

2016-06-15 Thread RW
On Wed, 15 Jun 2016 13:40:25 -0700 (PDT) John Hardin wrote: > On Wed, 15 Jun 2016, jaso...@mail-central.com wrote: > > and all the possible line-broken and "="-delimited variations? > > There's obviously a lot of them. > > That would have to be a rawbody rule AFAIK QP is decoded even in the r

Re: how to write body rules to match 'tortured html' variations of text phrases?

2016-06-15 Thread Groach
On 15/06/2016 22:42, Dianne Skoll wrote: On Wed, 15 Jun 2016 13:40:25 -0700 (PDT) John Hardin wrote: That's (more or less) "Quoted Printable" encoding. AFAIK, SpamAssassin "body" rules are applied after the Content-Transfer-Encoding: has been decoded. So the QP equal signs are a red herring

Re: how to write body rules to match 'tortured html' variations of text phrases?

2016-06-15 Thread Groach
On 15/06/2016 22:42, Dianne Skoll wrote: On Wed, 15 Jun 2016 13:40:25 -0700 (PDT) John Hardin wrote: That's (more or less) "Quoted Printable" encoding. AFAIK, SpamAssassin "body" rules are applied after the Content-Transfer-Encoding: has been decoded. So the QP equal signs are a red herring

Re: how to write body rules to match 'tortured html' variations of text phrases?

2016-06-15 Thread Dianne Skoll
On Wed, 15 Jun 2016 13:40:25 -0700 (PDT) John Hardin wrote: > That's (more or less) "Quoted Printable" encoding. AFAIK, SpamAssassin "body" rules are applied after the Content-Transfer-Encoding: has been decoded. So the QP equal signs are a red herring. Regards, Dianne.

Re: how to write body rules to match 'tortured html' variations of text phrases?

2016-06-15 Thread John Hardin
On Wed, 15 Jun 2016, jaso...@mail-central.com wrote: For example, here's a body snippet from one of those 'tortured' spams - #hearthrugs-tablecloths-dishcovers-coalscuttles-a { pl= ay-during: auto; page-break-before: auto }Succes= sful women join us and become even more successful.

how to write body rules to match 'tortured html' variations of text phrases?

2016-06-15 Thread jasonsu
I've installed SA 3.4.1. I'm writing body rules to deal with some persistent spam I'm getting. plain-text match rules are simple enough. Much of the spam contains 'tortured html'. I just want to get clear about how to correctly match it. For example, here's a body snippet from one of those 't

Re: Penalizing code not working

2016-06-15 Thread SA
Thanks for the extra info and advice, RW. Noted. :) -- Dave Stiles

Re: Penalizing code not working

2016-06-15 Thread SA
Thanks, RW. That's stopped it reporting valid gmail. Now I'll have to wait to see if it reports false gmail. :) -- Dave Stiles

Re: Penalizing code not working?: Don't mix company and user email domains.

2016-06-15 Thread Tom Hendrikx
On 15-06-16 00:13, Linda A. Walsh wrote: > > > spamassas...@linkcheck.co.uk wrote: >> The code below is found in several places online and for some months I >> have been trying to get it to work, but whatever I do it flags up Fail >> even if the source is good. Typically I have been concentrati