On Fri, 26 Jun 2009, Jonas Eckerman wrote: > Theo Van Dinter wrote: > > > the convolution is a > > fingerprint that you could write a rule for and then you don't care > > what the content actually is. For example, you'd render something > > like "doc_pdf_jpg", which would make an obvious Bayes token. In the > > same way for a zip file, you could do "zip_pdf zip_jpg zip_txt", etc, > > and they'd all be different tokes. > > That's really a good idea. Put the chains of extraction in a > pseudoheader that can be tested in rules and seen as a token by bayes. > > I'm putting that in the todo for the plugin.
It would be a bit cumbersome but you could: create a "pre-filter" program/milter which would parse attachments & MIME structures, create special pseudoheaders with the analysis results in them, insert them into the message and then pass it on to SA. The full power of SA would then be available to attack the exposed info in any way that you wanted and wouldn't require any mods to SA. If you were worried about information leakage you could create a post-filter that would remove the pseudoheaders. -- Dave Funk University of Iowa <dbfunk (at) engineering.uiowa.edu> College of Engineering 319/335-5751 FAX: 319/384-0549 1256 Seamans Center Sys_admin/Postmaster/cell_admin Iowa City, IA 52242-1527 #include <std_disclaimer.h> Better is not better, 'standard' is better. B{