On Tue, 13 Oct 2009, Jonas Eckerman wrote:
John Hardin wrote:
There were mutterings about a generic plugin that would take an
attachment, process it somehow (e.g. wvHtml, antiword, ps2ascii, or
whatever was appropriate), and insert the results into the body text to be
scanned by the regular rules.
That sounds very much like my ExtractText plugin. It can use command line
tools or perl plugins to extract text from attachments.
The plugin works, and we use are using it in our mail gateway.
It's listed on the Custom Plugins wiki page, and is available at
<http://whatever.frukt.org/spamassassin.text.shtml>.
It comes with a config for extracting text from Word, OpenXML, RTF, ODF and
PDF files.
Cool! I will have to take a look at that. Thanks!
--
John Hardin KA7OHZ http://www.impsec.org/~jhardin/
jhar...@impsec.org FALaholic #11174 pgpk -a jhar...@impsec.org
key: 0xB8732E79 -- 2D8C 34F4 6411 F507 136C AF76 D822 E6E6 B873 2E79
-----------------------------------------------------------------------
If Microsoft made hammers, everyone would whine about how poorly
screws were designed and about how they are hard to hammer in, and
wonder why it takes so long to paint a wall using the hammer.
-----------------------------------------------------------------------
11 days since a sunspot last seen - EPA blames CO2 emissions