Hi,

On 08/25/2014 05:17 PM, Michael Opdenacker wrote:
>
> Is there a simple way to give a penalty to messages containing non latin
> UTF-8 characters?
>
> I'm asking because we are receiving quite a lot of Chinese junk mail
> with subjects in Chinese (or more generally non-latin) characters, but:
>
> - The body is too short for 'ok_languages' to detect and discard the
> unwanted language.
>
> - The charset is UTF-8, and therefore 'ok_locales en' doesn't mind.
>
> - I shouldn't blacklist domains such as @163.com (a major source of
> spam) because there is legitimate traffic coming from this domain, for
> example e-mails sent to the LKML, which most of us subscribe to.
>
> I'm seeing fairly elaborate solutions on the net, but it surprises me
> that an apparently simple problem doesn't have a simple solution yet.

I find it hard to believe I'm the only one getting spam in Chinese
characters ;)

How do you guys handle this kind of spam? For the moment, I blacklisted
the 163 dot com and 126 dot com domains, without feeling too much guilt.
It's not a perfect solution though, as I'm excluding a few posters on
the LKML (for example).

Michael.

-- 
Michael Opdenacker, CEO, Free Electrons
Embedded Linux, Kernel and Android engineering
http://free-electrons.com
+33 484 258 098

Reply via email to