On Sun, 17 May 2009, Michael Monnerie wrote:

Finally measured again, it takes 1h7m to fetch from imap plus remove all markups:

I think the largest part of your problem is the "fetch" part.

The way this is usually set up is the training mailbox files reside on the same server that is doing the spam processing, so that the sa-learn process is purely local file access. IMAP is only a part of the equation to make it easier for users to save training messages from their mail clients.

Can you run sa-learn on the same server that is physically storing the training mailbox files and just use regular file access? Or are the messages sequestered in some proprietary data jail like the MS Exchange database?

--
 John Hardin KA7OHZ                    http://www.impsec.org/~jhardin/
 jhar...@impsec.org    FALaholic #11174     pgpk -a jhar...@impsec.org
 key: 0xB8732E79 -- 2D8C 34F4 6411 F507 136C  AF76 D822 E6E6 B873 2E79
-----------------------------------------------------------------------
  If you ask amateurs to act as front-line security personnel,
  you shouldn't be surprised when you get amateur security.
                                                    -- Bruce Schneier
-----------------------------------------------------------------------
 4 days until the 5th anniversary of SpaceshipOne winning the X-prize

Reply via email to