On Tue, 13 Oct 2009, Jonas Eckerman wrote:

John Hardin wrote:

 There were mutterings about a generic plugin that would take an
 attachment, process it somehow (e.g. wvHtml, antiword, ps2ascii, or
 whatever was appropriate), and insert the results into the body text to be
 scanned by the regular rules.

That sounds very much like my ExtractText plugin. It can use command line tools or perl plugins to extract text from attachments.

The plugin works, and we use are using it in our mail gateway.

It's listed on the Custom Plugins wiki page, and is available at <http://whatever.frukt.org/spamassassin.text.shtml>.

It comes with a config for extracting text from Word, OpenXML, RTF, ODF and PDF files.

Cool! I will have to take a look at that. Thanks!

--
 John Hardin KA7OHZ                    http://www.impsec.org/~jhardin/
 jhar...@impsec.org    FALaholic #11174     pgpk -a jhar...@impsec.org
 key: 0xB8732E79 -- 2D8C 34F4 6411 F507 136C  AF76 D822 E6E6 B873 2E79
-----------------------------------------------------------------------
  If Microsoft made hammers, everyone would whine about how poorly
  screws were designed and about how they are hard to hammer in, and
  wonder why it takes so long to paint a wall using the hammer.
-----------------------------------------------------------------------
 11 days since a sunspot last seen - EPA blames CO2 emissions

Reply via email to