On 13.10.2009 12:13, Jonas Eckerman wrote:
> John Hardin wrote:
> 
>> There were mutterings about a generic plugin that would take an
>> attachment, process it somehow (e.g. wvHtml, antiword, ps2ascii, or
>> whatever was appropriate), and insert the results into the body text
>> to be scanned by the regular rules.
> 
> That sounds very much like my ExtractText plugin. It can use command
> line tools or perl plugins to extract text from attachments.
> 
> There were a bit more than mutterings about it here. :-)
> 
>> I don't think anything has come of that yet.
> 
> The plugin works, and we use are using it in our mail gateway.
> 
> It's listed on the Custom Plugins wiki page, and is available at
> <http://whatever.frukt.org/spamassassin.text.shtml>.
> 
> It comes with a config for extracting  text from Word, OpenXML, RTF, ODF
> and PDF files.
> 

Old thread, but I'm trying this now when I'm hit by rtf spam.

How do go you configure this?

I installed the required binaries and then the ExtractText files to
/etc/mail/spamassassin

and created ExtractText.pre (no one told to do it but I figured that is
needed) with content

loadplugin Mail::SpamAssassin::Plugin::ExtractText
ExtractText.pm
loadplugin Mail::SpamAssassin::Plugin::ExtractText::OpenXML
ExtractText/OpenXML.pm

But SA -D --lint says

Jun  8 20:38:57.000 [10237] warn: plugin: eval failed: Insecure
dependency in eval while running with -T switch at
/etc/mail/spamassassin/ExtractText.pm line 577.
Jun  8 20:38:57.000 [10237] warn: config: failed to parse line,
skipping, in "/etc/mail/spamassassin/ExtractText.cf": extracttext_module
openxml Mail::SpamAssassin::Plugin::ExtractText::OpenXML

SA 3.3.1 in Debian Lenny here.


-- 
http://www.iki.fi/jarif/
I use PGP. If there is an incompatibility problem with your mail
client, please contact me.

You will step on the night soil of many countries.

Attachment: signature.asc
Description: OpenPGP digital signature

Reply via email to