Re: FROM header obfuscation

Kris Deugau Tue, 08 Feb 2022 07:43:16 -0800

Frido Otten wrote:

Hi All,
Recently we're seeing more spam passing our spamfilters using textobfuscating in the FROM header. The problem mainly targets users whichare using mail clients like iPhone Mail which are only displaying thedisplay name of the FROM header and not the actual email address whichwas used, bypassing DKIM measures. For example:
From: =?UTF-8?B?0KBvc3RubC5ubCDQoGFra2V0?= <a...@qbocel.com>
This is base64 encoded "Рostnl.nl Рakket" and pretends to come fromPostnl, a dutch snailmail company. However the hexadecimalrepresentation of this base64 decoded text differs from that of normalASCII:
Obfuscated:

$ printf "Рostnl.nl Рakket" | od -A n -t x1
  d0 a0 6f 73 74 6e 6c 2e 6e 6c 20 d0 a0 61 6b 6b
  65 74

Plain ASCII:

$ printf "Postnl.nl Pakket" | od -A n -t x1
  50 6f 73 74 6e 6c 2e 6e 6c 20 50 61 6b 6b 65 74

There is no way to tell the difference with the naked eye.

That depends on the font. Many variations do in fact look different,and from some of the FP-approaching "ham" I've seen that abuses this Ican only conclude that some marketing.... person has decided that thisis Necessary and Required and the tech folks can Go Suck It.

As far as I'm concerned, formatting outside of language accents oncharacters absolutely does NOT belong in either the From: name orSubject. An "a" in the From: name or Subject absolutely MUST bepresented as a US-ASCII "a", and not some extended UTF8 lookalikethat's... oooooo! in *italics*!

Naturally the spammers go to various amounts of effort to avoid the onesthat are clearly different.

Is there any way to detect this type of obfuscation with a spamassassinrule?

I have a longish list of rule groups similar to below for differentextended UTF8 ASCII-lookalike characters and words. Some are derivedfrom rules discussed on this list within the past year or so.


header  __SUSP_NAME_CHAR_01     From:name =~ /(?:\xd0[\xa0-\xbf])/
tflags __SUSP_NAME_CHAR_01 multiple maxhits 10

header __SUSP_NAME_CHAR_02 From:name =~/(?:\xef\xbc[\x80-\xbf]|\xef\xbd[\x80-\xa0])/

tflags __SUSP_NAME_CHAR_02 multiple maxhits 10
meta    __SUSP_NAME_CHAR        __SUSP_NAME_CHAR_01 + __SUSP_NAME_CHAR_02
meta    SUSP_NAME_CHAR_5        __SUSP_NAME_CHAR >= 5

describe SUSP_NAME_CHAR_5 5 or more lookalike characters in theFrom: name

score   SUSP_NAME_CHAR_5        1.5
meta    SUSP_NAME_CHAR_10       __SUSP_NAME_CHAR >= 10

describe SUSP_NAME_CHAR_10 10 or more lookalike characters in theFrom: name

score   SUSP_NAME_CHAR_10       1.75

I've used this tool:

https://www.utf8-chartable.de/

with a bit of effort to take an example character and locate the fulla-z list of entries for these rules. (Convert individual characters tohex, then flip pages until you've found the fakes. There are many groups.)

Single characters are trickier; depending on context I've added rulesfor individual lookalike characters, or whole words with mixed variants(and an exclusion for pure ASCII) as I see new runs of FNs.


-kgd

Re: FROM header obfuscation

Reply via email to