On Fri, 30 Jul 2010, Adam Moskowitz wrote:

I want to use spamassassin's per-user whitelisting as part of some mail processing I'm doing. I'm dealing with a lot of messages (potentially over 100,000), but doing it one-at-a-time (and I can't easily change that). spamassassin takes a long time to load and run (1.5 - 2 seconds per message), and it's performing over 50 tests per message even though for this purpose I need only 1 or 2 of those tests.

How about: run a spamd on a non-default port configured with your rules subset, and score your corpa using spamc against that spamd? You'll get much better performance than using a foreground spamassassin, and potentially you could multithread it.

Using straight spamassassin is a bad idea for performance reasons, as you've noticed.

--
 John Hardin KA7OHZ                    http://www.impsec.org/~jhardin/
 jhar...@impsec.org    FALaholic #11174     pgpk -a jhar...@impsec.org
 key: 0xB8732E79 -- 2D8C 34F4 6411 F507 136C  AF76 D822 E6E6 B873 2E79
-----------------------------------------------------------------------
  ...much of our country's counterterrorism security spending is not
  designed to protect us from the terrorists, but instead to protect
  our public officials from criticism when another attack occurs.
                                                    -- Bruce Schneier
-----------------------------------------------------------------------
 6 days until the 275th anniversary of John Peter Zenger's acquittal

Reply via email to