Hi, On 08/25/2014 05:17 PM, Michael Opdenacker wrote: > > Is there a simple way to give a penalty to messages containing non latin > UTF-8 characters? > > I'm asking because we are receiving quite a lot of Chinese junk mail > with subjects in Chinese (or more generally non-latin) characters, but: > > - The body is too short for 'ok_languages' to detect and discard the > unwanted language. > > - The charset is UTF-8, and therefore 'ok_locales en' doesn't mind. > > - I shouldn't blacklist domains such as @163.com (a major source of > spam) because there is legitimate traffic coming from this domain, for > example e-mails sent to the LKML, which most of us subscribe to. > > I'm seeing fairly elaborate solutions on the net, but it surprises me > that an apparently simple problem doesn't have a simple solution yet.
I find it hard to believe I'm the only one getting spam in Chinese characters ;) How do you guys handle this kind of spam? For the moment, I blacklisted the 163 dot com and 126 dot com domains, without feeling too much guilt. It's not a perfect solution though, as I'm excluding a few posters on the LKML (for example). Michael. -- Michael Opdenacker, CEO, Free Electrons Embedded Linux, Kernel and Android engineering http://free-electrons.com +33 484 258 098