On Fri, Jun 19, 2009 at 3:04 AM, Jason Haar<jason.h...@trimble.co.nz> wrote: > Speaking of image/rtf/word attachment spam; is there any work going on > to standardize this so that the textual output of such attachments could > be fed back into SA?
That functionality already exists (has for almost 3 years, actually), but as in the past (list archives) the documentation hasn't improved for it. :( Here's my last(?) post about it which has some sample code and everything: http://www.nabble.com/Re:-PDFText-Plugin-for-PDF-file-scoring---not-for-PDF-images-p11595641.html