On Sun, 2011-10-16 at 21:53 -0300, Christian Grunfeld wrote: > easier than that ! > you dont need to check any ratio at all ... as legitimate mails dont > have non-word characters between characters ! ^^^^^^^^ > Non spamer people don´t write subjects like that ! ^^^^^ > Spamers had to do that in order to avoid sex, porn, xxx, viagra > directly in subject (which is more or less easily detected)...but when ^^^^^^^^^^^^^^^ > they put things in between you can be 99.999% confident it is spam ! ^^^^^^^
Yup, there never ever are non-word chars between word chars in human generated legit mail... -- char *t="\10pse\0r\0dtu\0.@ghno\x4e\xc8\x79\xf4\xab\x51\x8a\x10\xf4\xf4\xc4"; main(){ char h,m=h=*t++,*x=t+2*h,c,i,l=*x,s=0; for (i=0;i<l;i++){ i%8? c<<=1: (c=*++x); c&128 && (s+=h); if (!(h>>=1)||!t[s+h]){ putchar(t[s]);h=m;s=0; }}}