On Fri, 2010-05-28 at 12:54 -0700, John Hardin wrote: > It's becoming much more prevalent. I'm seeing lots of 419 and phishing > spams as .txt and .html attachments, more rarely .doc or .rtf.
> Writing an extractor for .txt attachments is trivial, and it shouldn't be > too hard to do HTML. RTF and PDF are included, but do require external > tools be present. Err, unless I am terribly mistaken and didn't interpret the docs correctly (too lazy to test) -- SA always uses any textual part. That includes plain text and HTML "attached" to the message. It's just another MIME part anyway. It's a whole different cattle of fish with RTF... -- char *t="\10pse\0r\0dtu...@ghno\x4e\xc8\x79\xf4\xab\x51\x8a\x10\xf4\xf4\xc4"; main(){ char h,m=h=*t++,*x=t+2*h,c,i,l=*x,s=0; for (i=0;i<l;i++){ i%8? c<<=1: (c=*++x); c&128 && (s+=h); if (!(h>>=1)||!t[s+h]){ putchar(t[s]);h=m;s=0; }}}