On Sun, 17 May 2009, Michael Monnerie wrote:
Finally measured again, it takes 1h7m to fetch from imap plus remove all
markups:
I think the largest part of your problem is the "fetch" part.
The way this is usually set up is the training mailbox files reside on the
same server that is doing the spam processing, so that the sa-learn
process is purely local file access. IMAP is only a part of the equation
to make it easier for users to save training messages from their mail
clients.
Can you run sa-learn on the same server that is physically storing the
training mailbox files and just use regular file access? Or are the
messages sequestered in some proprietary data jail like the MS Exchange
database?
--
John Hardin KA7OHZ http://www.impsec.org/~jhardin/
jhar...@impsec.org FALaholic #11174 pgpk -a jhar...@impsec.org
key: 0xB8732E79 -- 2D8C 34F4 6411 F507 136C AF76 D822 E6E6 B873 2E79
-----------------------------------------------------------------------
If you ask amateurs to act as front-line security personnel,
you shouldn't be surprised when you get amateur security.
-- Bruce Schneier
-----------------------------------------------------------------------
4 days until the 5th anniversary of SpaceshipOne winning the X-prize