-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 On Fri, Jul 25, 2003 at 11:25:17PM +0200, Chr. von Stuckrad wrote: > > body MY_CONSONANT_4 /[^aeiou]{4}/ > > describe MY_CONSONANT_4 Body contains 4 consecutive consonants. > > score MY_CONSONANT_4 0.15 > > The pattern might be dangerous for french, chinese, > or polish mails :-) Because chinese utf8 or koi8 code has > 'only consonants' of the above definition. > And e.g. frnch has lots of accented characters. > Also polish has words with SO many consonants > in a row, that even we germans have problems with > those words :-)
Well, I did give a warning about languages. In my case I don't speak Chinese, Polish, German or French, so I consider anything in those langauges spam. I only speak Spanish and English. When I'll learn French (which I intend to some day) I'll update the rule. > So the pattern may work in somebody's *private* > setup but might be a bad idea in a global setting. Yes, definitely. That's what it was meant for. - -- Daniel Carrera | OpenPGP fingerprint: Mathematics Dept. | 6643 8C8B 3522 66CB D16C D779 2FDD 7DAC 9AF7 7A88 UMD, College Park | http://www.math.umd.edu/~dcarrera/pgp.html -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.2.2 (SunOS) iD8DBQE/IavvnxE8DWHf+OcRApKFAKCa+wc57E/ii0x0mGZkEAoe7zRk3QCgn28d 17hZd1GXeLU/bawc4rGkFew= =7LLG -----END PGP SIGNATURE----- ------------------------------------------------------- This SF.Net email sponsored by: Free pre-built ASP.NET sites including Data Reports, E-commerce, Portals, and Forums are available now. Download today and enter to win an XBOX or Visual Studio .NET. http://aspnet.click-url.com/go/psa00100003ave/direct;at.aspnet_072303_01/01 _______________________________________________ Spamassassin-talk mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/spamassassin-talk