On 13.10.2009 12:13, Jonas Eckerman wrote: > John Hardin wrote: > >> There were mutterings about a generic plugin that would take an >> attachment, process it somehow (e.g. wvHtml, antiword, ps2ascii, or >> whatever was appropriate), and insert the results into the body text >> to be scanned by the regular rules. > > That sounds very much like my ExtractText plugin. It can use command > line tools or perl plugins to extract text from attachments. > > There were a bit more than mutterings about it here. :-) > >> I don't think anything has come of that yet. > > The plugin works, and we use are using it in our mail gateway. > > It's listed on the Custom Plugins wiki page, and is available at > <http://whatever.frukt.org/spamassassin.text.shtml>. > > It comes with a config for extracting text from Word, OpenXML, RTF, ODF > and PDF files. >
Old thread, but I'm trying this now when I'm hit by rtf spam. How do go you configure this? I installed the required binaries and then the ExtractText files to /etc/mail/spamassassin and created ExtractText.pre (no one told to do it but I figured that is needed) with content loadplugin Mail::SpamAssassin::Plugin::ExtractText ExtractText.pm loadplugin Mail::SpamAssassin::Plugin::ExtractText::OpenXML ExtractText/OpenXML.pm But SA -D --lint says Jun 8 20:38:57.000 [10237] warn: plugin: eval failed: Insecure dependency in eval while running with -T switch at /etc/mail/spamassassin/ExtractText.pm line 577. Jun 8 20:38:57.000 [10237] warn: config: failed to parse line, skipping, in "/etc/mail/spamassassin/ExtractText.cf": extracttext_module openxml Mail::SpamAssassin::Plugin::ExtractText::OpenXML SA 3.3.1 in Debian Lenny here. -- http://www.iki.fi/jarif/ I use PGP. If there is an incompatibility problem with your mail client, please contact me. You will step on the night soil of many countries.
signature.asc
Description: OpenPGP digital signature